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Directors’ Note 


The 1969 Scottish Universities Summer School in Physics was the tenth, in 
what is now a well-established annual series. The formal lecture programme 
is fully presented in this volume, and in addition we have included a selection 
from the seminars in which participants in the school—students and staff— 
spoke about their own work and interests. Three visitors to the school— 
Professors A. Kastler, L. Mandel and A. L. Schawlow—also gave seminars 
which were enthusiastically appreciated, but it has not been practicable to 
include them here. 

It was a particular source of pleasure to all of us at this 10th School that 
Professor Kastler, accompanied by Madame Kastler, spent ten days with us, 
sharing in all our activities, academic, recreative and social. 

The devoted work of the editors, Susan Kay and Arthur Maitland, is at- 
tested by this volume. We are glad to be able to acknowledge here the great 
contribution made to the School by the Secretary, Dr. A. G. A. Rae, the 
Treasurer, Dr. A. L. S. Smith, and the other members of our Committee. 


P. S. Farago 


Joint Directors. 
R. M. Sillitto 


Editors’ Note 


Each of the chapters in this book covers the material delivered by the 
lecturers at the Summer School and has been largely prepared by them with 
the minimum of editing. The chapter by Professor Glauber has been prepared 
by Mr. A. Maitland from tape recordings together with notes taken by Dr. 
D. F. Walls and Mr. S. S. Rangnekar. Professor Glauber kindly gave his per- 
mission for a verbatim transcript to be taken in an attempt to bring the 
tutorial atmosphere and spirit of a summer school to the more usual cold 
print of published proceedings. The lectures given by Professor Louisell 
closely followed those given at the 1967 Enrico Fermi Summer School, Course 
42. Dr. Pike’s chapter is based substantially on a review presented at the 
inaugural meeting of the European Physical Society. The articles by Haken 
et al., Graham, Walls, Haake, Pefina and Bénard were presented as seminars. 
Some of these have been revised by the Editors. 

The following lectures were also given at the School, but have not been in- 
cluded in the Proceedings. Professor A. Kastler delivered two lectures on 
“Optical Pumping”. Professor A. L. Schawlow gave a seminar on “The 
Laser — Present and Future”, and Professor L. Mandel presented a paper on 
“The Interference of Photon Beams at Low Light Intensities’’. 

Whilst most of the authors have seen their manuscripts in proof, the Editors 
accept responsibility for anv errors and misprints. 


A.M. 
August 1970 S.M.K. 
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Introductory Remarks 
R. M. SILLITTO 


Quantum optics began when quantum physics began—or perhaps slightly 
before, since quantum optics initiated quantum physics. The investigation of 
the black-body spectral distribution, and its interpretation with the help of the 
statistical mechanics of radiation, led to the hypothesis of quantization, while 
Einstein’s interpretation of the photoelectric effect introduced the notion of 
the photon. The correlation of the statistical and spectral characteristics of 
radiation recurs again and again as one of the central tasks of quantum optics, 
and the statistics of photo-emission are primary data for the discussion of the 
statistical and coherence properties of light-fields. 

But quantum optics, as we know it, hardly existed until the 1950s—though 
the lack of it had struck some people as odd. Glauber summed this up at 
Rochester in 1966 when he remarked that quantum theory owed a lot to 
optics, but had only recently begun to repay the debt. At the beginning of this 
volume on Quantum Optics, it is worth while giving a little space to a consi- 
deration of this curious historical phenomenon. 

Physical optics has, in many ways, been remarkably unperturbed by the 
two major achievements of 20th century physics. Maxwell’s equations, extra- 
polated into a frequency domain far removed from that which provides their 
experimental basis, are the foundation of theoretical physical optics. For very 
profound reasons concerned with the intimate relationship between the electric 
and the magnetic field, these equations were already Lorentz invariant, and so 
survived the advent of special relativity. And the extrapolation to frequencies 
10'° or more times those of Faraday’s experiments on electromagnetic induc- 
tion proved adequate in all respects save one. That one deficiency was certainly 
important—it needed the hypothesis of quantization to remedy it. But this 
necessity appears in optics only when we are concerned to discuss the proces- 
ses of emission and absorption, and such related processes as dispersion. By 
relegating the details of these discussions to atomic physics, physical optics 
remained serenely above the battle. 

In large part, the reliance which those who developed quantum theory 
could place on optical evidence stemmed from the apparent inviolability of 
physical optics. Attempts to find quantum effects in the physical—optical fields 
of diffraction and interference—by Taylor“? in 1909, by Dempster and 
Batho™ in 1927, and by Janossy and Naray® in 1957—were convincingly 
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negative,t and served two distinct and important purposes: firstly, they con- 
firmed the reliability at all intensity levels of the interferometric evidence on 
which optical spectroscopy depends; and secondly, they provided the experi- 
mental basis for the interpretation of the wave-function as giving probabilities 
relating to a single particle, rather than describing directly the probability- 
density of particles. It is worth remarking here that interference experiments 
prior to 1960 were always carried out at “low intensities”, in the sense that the 
degeneracy parameter of the radiation employed—that is, the number of 
quanta per coherence volume—was always less than unity by two or more 
orders of magnitude. Lacking this critical measure of intensity the experi- 
menters mentioned above used other criteria for deciding how low the inten- 
sity had to be for their experiments to be significant—for instance, that the 
average energy within their interferometers at any time should be less than a 
quantum. In retrospect this requirement seems unnecessarily severe, since the 
crucial question is whether the degeneracy parameter is, or is not, much less 
than unity. 

Quantum effects in optical processes were sought in other ways, and 
Campbell”, with rare perception, attempted an intensity-correlation experi- 
ment in 1909. The experiment was reported in two fascinating papers, the first 
of which began: “This is an account of an experiment which failed. ..”; and 
the second reported that, “The experiment has continued unsuccessful.” 
Campbell was not the last person to be unsuccessful with an intensity-correla- 
tion experiment, although Bay and Farago™ have pointed out that some of 
the photon-coincidence experiments which produced null-results were attempts 
to detect an effect linear in the intensity: such an effect, if it existed, would not 
be an intensity correlation. 

As long as optical detectors were long-time-constant devices—and “long- 
time’ here means so long, that many quanta are detected within the response 
time—quantization does not need to be considered, though the broader 
statistical characteristics of the radiation do. So coherence theory was devel- 
oped to take into account the statistical and spectral features of the light emit- 
ted by the high-frequency noise-generators which optical experimenters used 
as light sources. The random process of a light-beam was assumed to be 
Gaussian. The justification for this assumption was not complete, but once 
the assumption was made it allowed attention to be concentrated on the 
second moment of the probability distribution of the light-field. The measure- 
ments of what was called the second-order coherence function (and is some- 
times now called the first-order coherence function) determined the single 
parameter of the Gaussian distribution. In the years after the war, Wolf, 


+ A reported positive result by Dontsov and Baz (1967)(4) cannot yet be regarded as 
established. 
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Hopkins and others developed the classical coherence theory—which had ori- 
ginated as early as 1855‘”—into a theory of physical optics in terms of observ- 
ables*); the unobservable time-variations which underlie elementary wave- 
optical theory had been eliminated. The resemblance between coherence theory 
in this form, and the theory of random signals in radio engineering and com- 
munications theory, now became clear and fruitful. This merging of languages 
and modes of thought has had important consequences in radio and optical 
astronomy, in studies of optical imaging, and in the developing technology of 
information storage and retrieval. 

The development of coherence theory to the point where it became a 
classical theory of optics in terms of observables has, in retrospect, a two-fold 
significance for quantum optics. Firstly, it prepared the ground for the intro- 
duction of a quantum theory, which has been developed since 1963 in the work 
of Glauber and others; and secondly, it established a great domain of optical 
theory which must survive any quantum reappraisal. 

In a sense, this latter point seemed to have been anticipated ina very general 
way by Heisenberg in 1930, when he established “that the classical wave 
theory is sufficient for the discussion of all questions of coherence and inter- 
ference”’. But this dictum seems to have been forgotten—or at least overlooked 
when the experiments of Forrester, Gudmundsen and J ohnson, and of 
Hanbury-Brown and Twiss, aroused controversy and challenged explanation. 
In classical terms, the results of both these experiments were easily explained— 
except, of course, that the use of photoelectric detectors was vital to both 
experiments, and the explanation of photoelectric emission must be quantal. 
Both experiments exhibited the intensity correlation phenomenon Campbell 
had sought almost fifty years before—in the one case correlations in time, and 
in the other case correlations in space. The argument that this violated Dirac’s 
dictum, that a photon “interferes only with itself”’, arose from a too naive idea 
of what is implied by the term “photon”, and has been very effectively ans- 
wered by, amongst others, Hanbury-Brown and Twiss, and Mandel. What 
emerges very clearly from a reading of the papers written while the debate was 
still keen, is the necessity—in an era of photomultiplier tubes and fast elec- 
tronic circuitry—for a theory which allows the discussion of optical coherence 
and of the interaction of matter and radiation to be carried on within a single 
formalism. The separation of physical optics and of atomic physics could no 
longer be maintained, and a quantum theory of optical coherence was 
urgently needed. 

Meanwhile other strands of the fabric of quantum optics were being spun. 
Microwave engineering, statistical mechanics, and the quantum theory of 
matter came together in the invention of the maser in 1954, and quantum 
electronics began to emerge as a recognizable discipline. The negative resist- 
ance characteristic, which is necessary for the possibility of oscillation, is 
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achieved by the production of a population inversion in the active medium, 
and quantum electronics has been defined as the study and exploitation of 
population inversion. The induced and spontaneous emissions from the upper 
maser level provide the signal and the inherent noise respectively in the maser, 
and the term “quantum noise” was coined to refer to spontaneous emission in 
this context. The importance of the maser was at once seen to be its extremely 
good signal: noise capability. Simple considerations on the transition proba- 
bilities for spontaneous and induced emission as functions of frequency show- 
ed that the inherent signal :noise ratio of maser devices at optical frequencies 
might be much less favourable than in the microwave region, and it was six 
years before the successful operation of the laser laid the doubts to rest. 

The successful operation of the laser in 1960 brought a new series of prob- 
lems; one set, concerned with the nature of laser light, and the features which 
distinguish it from “‘ordinary” or thermal light, the other set, concerned with 
the way the laser itself works. The two sets of problems are obviously related, 
since, on the one hand, the peculiar features of the output light are conse- 
quences of the way the device works, and, on the other hand, experimental 
investigations of the statistics of the photoelectron emission produced by 
laser light furnish data for testing theories of the mechanism of the laser. It 
turns out that a classical model derived from the Van der Pol treatment of the 
triode oscillator provides a useful phenomenological theory of the laser as a 
non-linear oscillator driven by noise—the spontaneous emission from the 
upper laser level. Both intensity-noise and phase-noise are predicted by this 
model, in terms of a very small number of model parameters, and the model 
provides a useful basis for the discussion of experimental results on the photon 
statistics of laser beams. The detailed description of the working of the laser, 
in terms of the processes operating at the microscopic level, is a difficult exer- 
cise in the quantum mechanical non-equilibrium statistical thermodynamics 
of an open system.It has been approached in stages—firstly, using quantum 
mechanics to describe the atoms in the laser-active medium, and classical 
electromagnetic theory to describe the radiation field; then, by adding to this 
description a random polarization to describe, phenomenologically, the effects 
of spontaneous emission; and finally, using the quantized field theory and the 
Fokker—Planck equation to describe the build-up of the radiation field. This 
work is able to account for the observed intensity fluctuations in lasers operat- 
ing both below and above threshold, and for the observed variation of band- 
width with pump power; in particular, the relative contributions of phase- 
noise and intensity-noise can be understood. A very different treatment in 
terms of rate-equations accounts satisfactorily for the observations on inten- 
sity fluctuations, but does not include the effects of phase-diffusion. 

The role of the phenomenon of “‘super-radiance”’ in laser operation has 
recently attracted attention. The extension of the Weisskopf—Wigner theory of 
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natural line-width, to systems of many atoms in the same excited state, pre- 
dicts emission phenomena which, depending on the density and configuration 
of the excited ensemble, may display some of the features characteristic of 
laser emission, or may in other circumstances exhibit the characteristics of the 
radiation-trapping which is observed in resonance-fluorescence. In the one, 
super-radiant, case the radiative lifetime of the ensemble is much less than that 
of a single excited atom, while in the other, sub-radiant, case the lifetime may 
be greatly increased. The terms “induced emission” and “spontaneous emis- 
sion” do not seem to be appropriate any longer in discussing the radiative 
properties of such ensembles. The relationship between this approach to laser 
theory and the approach using the Fokker—Planck equation raises a number 
of interesting and fundamental questions. 

The Fokker—Planck equation treatment, which has now been very fully 
worked out, leads to detailed predictions about the build-up in time of the 
laser-field, the intensity fluctuations and line-width at different power levels, 
and—as a corollary of this—the coherence properties of the emitted light. 
Haken has pointed out that the laser light-field “appears as a carrier of the 
quantum fluctuations of the atoms, which (fluctuations) are filtered and trans- 
formed by a non-linear process”. The study of these intensity fluctuations, or, 
equivalently, of the photon-statistics of the light-field, is thus a test of the 
microscopic theory and, besides, gives an insight into the nature of the light- 
field. To describe the properties of such fields, the theory of coherence, as it 
evolved in relation to the Gaussian light-fields of thermal light sources, has 
had to be extended and refined, additional statistical parameters being related 
to a hierarchy of coherence functions defined in slightly different ways by 
different authors. The results of multidetector correlation experiments can 
be expressed in terms of these coherence functions and by a suitable ordering 
of creation and annihilation operators one can construct appropriate cor- 
relation functions to describe multiple coincidence rates for either photo- 
detectors or stimulated -emission photon counters. As yet, little experimental 
work has been done on the higher-order correlations which are revealed 
either by triple- and higher-order coincidence experiments, or by photon- 
statistics experiments, but enough has been done to show that there is signifi- 
cant new information to be obtained in this way.“° Meanwhile, intensity 
correlation experiments, in which photon-coincidences are observed or photon- 
statistical distributions are determined, have provided the means of measuring 
optical line-widths orders of magnitude narrower than could be measured by 
optical interferometry. 

Even before the advent of the laser, very long Fabry-Perot etalons were 
constructed for spectrometry, and for use as very narrow-band optical filters. 
For instance, a 156 cm etalon was in use at NBS in the late 1950s for measure- 
ments of the spectral line-width of an atomic beam light source, and Connes 
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used an etalon 260 cm long as a filter to produce a very narrow spectral line 
for experiments on light-modulation, in about 1960. The use of Fabry—Perot 
systems as laser resonant cavities has sparked extremely interesting and 
fruitful investigations of the physical optics of the Fabry-Perot resonator, 
both when it functions passively—as interferometer or filter—and when it 
functions actively, with excited matter in the cavity. Computer solutions of 
the electromagnetic field equations in the cavity have revealed subtle comp- 
lexities in its behaviour which were previously unsuspected, and both the 
parallel-plate system and a variety of confocal and non-confocal systems 
using mirrors or lenses have been studied; some of these systems have un- 
familiar forms (see, e.g. Toraldo di Francia’s Chapter in this volume). The 
mathematics of these studies is elegant in itself, and in practical terms the 
development of the confocal resonator system and its derivatives has a lot to 
do with the ruggedness and reliability of presently available gas lasers. The 
discussion of the properties of cavities containing active material with a comp- 
lex dielectric constant is, of course, one of the possible avenues of approach 
to laser dynamics. 

While the maser was being developed as a low-noise microwave amplifier, 
microwave engineers and physicists were showing a revived interest in the 
potentialities of an alternative device, the parametric amplifier. The principle 
of the energy-conversion mechanism of the parametric amplifier was illu- 
strated by phenomena observed by Faraday,‘'” and Melde,“”) and was ana- 
lyzed and systematically experimented with by Lord Rayleigh.“* It was used 
in a radio-telephone system between Berlin and Vienna before 1915, and Lee 
de Forest, the inventor of the triode, remarked in 1916 that, at that time, the 
“magnetic amplifier’, as the then-current form of parametric amplifier was 
called, had advantages over the triode at high power levels. However, he 
predicted that the development of high-power triodes might soon change this 
situation, and in this he was right. Interest in the parametric amplifier then 
waned, although a number of communications theorists, among them R.V.L. 
Hartley, continued to study the principle, and in 1928 Hartley suggested, at a 
meeting of the American Physical Society, that the Raman effect could be 
understood in terms of the same general coupling and frequency-mixing 
mechanism as the parametric amplifier.“*) The rates of energy transfer in 
these systems are described by a set of very general relations, due to Manley 
and Rowe, which seem to apply wherever the coupling device or medium is a 
non-linear reactance. 

The theory of quantum mechanical oscillators and attenuators, which 
originated in the 1950s, can be used to discuss such systems as an idealized 
maser, or a parametric amplifier, or frequency converter“ and it has not 
always been clear where the dividing line between these devices should be 
drawn—or, indeed, if such a dividing line is necessary or useful. The ele- 
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mentary process involved in the optical parametric amplifier, oscillator, or 
converter, is sometimes referred to as a three-boson process, although the 
term should not be taken too literally. Parametric processes may be phase- 
dependent, and this means that the fields involved should be represented by 
coherent states—or, if the intensities are high enough, by classical waves—and 
these are states in which the numbers of quanta are ill-defined. However, the 
term is useful as far as it helps in the classification of the various phenomena 
which may appear; thus, the annihilation of a photon, accompanied by the 
creation of a photon of different frequency or polarization and a phonon, is 
characteristic of the stimulated Raman or Brillouin effect, while the annihila- 
tion of one photon and the creation of two photons is the elementary mechan- 
ism of the optical parametric oscillator or amplifier. The latter has the inter- 
esting and potentially useful property of being tunable over a quite consider- 
able wavelength range—tuning ranges of up to 3:1 in wavelength, and output 
powers up to 1kW, have been reported. © 

In these devices, the parameter which couples the oscillating fields of differ- 
ent frequencies is a field-dependent polarizability,"” so the mechanisms in- 
volved lie within the domain of non-linear optics. The advent of the laser, with 
its very high field strengths, has made it possible to exhibit and study a great 
variety of very striking effects in solids, liquids, gases and plasmas. In solids, 
these effects provide the solid-state physicist with a powerful armoury for the 
study of elastic behaviour and its structure-dependence. In particular, the 
crystal structure changes at phase transitions modify the non-linear optical 
behaviour. The theory of non-linear effects in solids has been very highly 
developed to treat interactions with phonons, magnons, polaritons and other 
excitations. The behaviour of sound waves in liquids, critical point pheno- 
mena, and molecular excitation and vibration processes have been studied, 
along with various electrical and electromagnetic phenomena such as dielec- 
tric breakdown, and multiphoton excitation. Some of the optical effects are 
very beautiful in their own right—the ring patterns produced in induced 
Raman scattering, for instance, and the very striking phase-matching process 
which Giordmaine and Terhune discovered in negative uniaxial crystals. To 
the experimenter at least, the evident beauty of so many optical phenomena 
has always been one of the appealing aspects of optics, and these effects are 
surely in this tradition. 

It is often the case that new tools and techniques are vigorously applied and 
exploited even while their structure and mode of operation are still being 
studied. This happens here: the laser is used to generate non-linear optical 
processes, and thus we learn to understand better the effect of the non-linear 
properties of the active material in the laser cavity itself. 

In many respects, the elementary processes which are important in quantum 
optics are exhibited most directly in optical pumping experiments. Here, inter- 
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actions are produced between “free” atoms and one or more oscillatory 
fields, so that, for instance, one- or two-photon transitions can be studied 
without introducing the complexities of condensed matter. But even in tenu- 
ous vapours the “free” atom is a fiction—all the atoms present are coupled 
through their interaction with the radiation field, and this coupling provides a 
rich harvest of effects for study and elucidation. 

The term “pompage optique’”’ was introduced in 1950 by Kastler to des- 
cribe a particular method of producing a population inversion, but there are 
affinities between modern optical pumping experiments and experiments on 
the magnetic depolarization of resonance radiation by Wood and Ellett“® and 
by Hanle“!” in 1923, and the subsequent experiments on the effect of radio- 
frequency fields on the polarization of resonance radiation by Fermi and 
Rasetti?™ and by Breit and Ellett.??» The Hanle experiment is a special case of 
a level-crossing experiment, exhibiting optical effects arising from the coherent 
superposition of excited quantum states. In the “coherent trapping” process 
this quantum mechanical—or ‘“‘Hertzian”—coherence is transferred by the 
radiation field from one atom to another or others. It is not obvious that the 
radiative interaction should be the only one which can transfer Hertzian co- 
herence between atoms, and there are various experiments which show that it 
can be transferred in collisions.?” 

There is an idication here of the very intimate connection between optical 
coherence and atomic properties. This connection obviously underlines the 
mechanism of coherent emission in the laser, and in optical parametric devices, 
but the optical pumping experiments seem to show it in its most elementary 
form. Considerable interest attaches to further studies of these effects, there- 
fore, and to theoretical work such as that of Dicke,‘** Barrat’* and Ernst 
and Stehle,‘*>) which relates them to the process of coherent emission for 
more complex ensembles. 

The past fifteen or twenty years has seen a veritable renaissance in optics. 
This has happened before. For instance, in 1900 Drude wrote that in optics 
“there pulses a new life whose further nourishing must be inviting to every 
one’’. In 1956, Zernike said at a meeting in Boston that, activity in optics pro- 
ceeds in waves, and that at that time, optics was emerging from a trough and 
heading for a peak. In large part the enormous wave of activity on whose 
crest we now seem to be riding has been stimulated by the merging of a rich 
variety of streams of thought and technique. It is the purpose of this volume 
to present articles which will clarify in the readers’ minds the nature and inter- 
relations of some of the major influences in this creatively non-linear process. 
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Quantum Electrodynamics 


T. W. B. KIBBLE 


Quantum electrodynamics is the theory which describes the interaction of 
electrons (either free or bound in atoms) with the electromagnetic field, or 
equivalently with photons. In the complete theory, both the electrons and 
the field are treated quantum mechanically, but for some purposes a classical 
treatment of one or the other may suffice. In what is generally called semi- 
classical theory, the electrons are treated quantum mechanically, but the 
field is regarded as an externally prescribed classical field. However, in certain 
problems we may reverse this procedure and treat the field quantum mechani- 
cally, while replacing the electrons by a classical current distribution, exter- 
nally prescribed. 

We shall begin by reviewing briefly the Hamiltonian formalism in both 
classical and quantum mechanics, using the harmonic oscillator as an example. 
Then we shall go on to deal with semiclassical theory, applied particularly 
to induced emission and absorption of radiation by atoms. Next, we shall 
discuss the quantization of the electromagnetic field, and its interaction with 
a classical current distribution. Finally, we turn to the full theory of quantum 
electrodynamics. 


1. HAMILTONIAN FORMALISM 


1.1 CLASSICAL MECHANICS 

The time development of a classical system whose instantaneous state is 
specified by the generalized coordinates and generalized momenta q,, ..., dn 
and py, ..., p, is described by Hamilton’s equations 


dg; _ 0H dp; OH 


= pee Fs 1 
ap ar ae n 


where H is the Hamiltonian function, a function of the coordinates and 
momenta, and possibly also explicitly of the time. Since the coordinates and 
momenta are themselves functions of time, the rate of change of H is 


dH 3 ie dg; , oH ae oH 
dt 0q; dt Gp; dt 
il 
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since by (1) the terms in the sum cancel. If H is not explicitly time-dependent, 
then it is a constant of the motion, whose value is the total energy. The ap- 
pearance of an explicit time-dependence means that the system is subjected to 
external forces. 

For example, a forced harmonic oscillator (in one dimension) is described 
by the Hamiltonian function 


eee a + 4mw*q? — q f(t), (3) 


where f(t) is a given function of time. Hamilton’s equations are 


dq OH p dp 0H | 5 
dt tm de age @) 


It is convenient to introduce in place of g and p a complex variable 
z= moog + ip. (5) 
It satisfies the equation of motion 
dz/dt = —iwz+i f(t). (6) 


If there is no external force, then the motion corresponds to a circle, described 
with constant angular velocity —w, in the complex z plane (which may es- 
sentially be identified with the two-dimensional classical phase space). More 
generally, the solution of (6) is 


z(t) = Zo exp (—ion) +i | dt’ f(t’) exp [—im(t—t’)], (7) 


where Z, is an arbitrary complex constant. In particular, if f vanishes except 
during some finite time interval tp <<¢<t,, then 


z(t) = z9exp(—iot) for t<ty 
and 
z(t) = z,exp(—iot) for ¢t>4t,, 


where the constants z, and z, are related by 
ti 
Z4—-Zy =i | dt f(t) exp Got) = if(a). (8) 
to 


When f = 0, H is constant, and equal to |z|?/2m. 
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1.2 REVIEW OF QUANTUM MECHANICS 


In quantum mechanics, the coordinates and momenta are replaced by opera- 
tors 4,,..., 4, and p,,..., p, which satisfy the commutation relations 


[4:. | = [B;, p,] = 0, [4:, pj] = ind; ;, (9) 
where [A, B] = AB-— BA, and where 6,; is the Kronecker delta symbol defined 
by : ifi=j, 

Flo if i# 7. 


In one particular representation (the g representation), these operators act on 
functions w(q;, ... ,g,), according to the relations 


av = ay, pi = — hy. (10) 


We prefer however not to choose a particular representation, but to use the 
more abstract Dirac notation, in which the states are represented by state- 
vectors ]y> on which the operators act. There is defined a scalar product 
<$|> between any two such state-vectors, satisfying <¢|y~>* = <w|d>, where 
the star denotes complex conjugation. (In the g representation, the scalar pro- 
duct is the integral over all g; of *w.) Two state-vectors |¢> and c|¢@> which 
differ only by an overall multiplicative factor describe the same state. It is 
possible to normalize the state-vectors so that (¢|¢> = 1, but even then there 
remains an arbitrary phase factor in |). 
For any operator A, we define the Hermitian conjugate A* by 


<PlA*l> = <WIAlo>*. 
If Alp> = [x>, then for any |), 


<OlA*> = WIAIb>* = WhO* = Kal, 


so that we may write <¢|A* = <y|. Observables correspond to Hermitian 
operators, such that A* = A. For example, the operators defined by (10) are 
Hermitian, as may be proved by partial integration. 
Every observable possesses a complete set of eigenvalues a and eigenvectors 
la> such that 
Ala> = ala). 


It is easy to prove that the eigenvalues a must be real, and that eigenvectors 
corresponding to different eigenvalues must be orthogonal. If the eigenvalues 
form a discrete set, then the corresponding eigenvectors can be normalized, 
so that 


<ala’»> = Sa" (1 1) 
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The set of eigenvectors is complete in the sense that any vector |y> may be 
expanded as a linear combination of them, 


> = Via ve 
By (11), <al> = w,, so that we may also write the expansion in the form 


> = 2 lad<aly> 


or, symbolically, 


1 = Sla><al. (12) 


Physically, <a|y> is related to the probability that the result of a measurement 


of A in the state |W) is a. In fact, this probability is |<afw >|? if fy> is normali- 
ized. The expectation value of A in the state |W) is 


a = Yial<aly>l? = CiAlp>. 


The uncertainty in the result of the measurement is characterized by the vari- 
ance 
Aa? = (a—@)” = (a*)—(a)’. 
If the set of eigenvalues is continuous, then instead of a sum in (12) we 
must have an integral, 


i= | da la> <al. 
Correspondingly, the eigenvectors must be normalized according to 
<ala’> = d(a—a’), 
where 6 is Dirac’s delta function (mathematically a distribution), defined by 


[ da’ 6(a—a’) f(a’) = f(a) 


for any continuous function f. In this case, the individual eigenvectors are not 
normalizable, and do not really belong to the space of state-vectors. They do 
not correspond to real physical states, except in an idealized limiting sense. 
In this case, |<a|y>|*da is the probability that the result of the measurement of 
A lies between a and a+da. 

The time development of a quantum-mechanical system is again governed 
by the Hamiltonian function, which is now an operator A. It is described by 
Schrédinger’s equation 


ih + lw, t> = Alw,t. (13) 
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If |E> is an eigenvector of H, statisfying 
A\E> = E|E), 
then a particular solution of (13) is 
IW, £> = [E> exp (—iEt/h). 


In this case, the state-vector depends on time only through a phase factor, so 
that the physical state does not change with time. Since any state-vector can 
be expanded in terms of eigenvectors of Af, the general solution of (13) is a 
linear superposition of these stationary solutions. 


Wt) = exp (—iAt/hWY, 0) = Yt E> exp (—iEt/h) CEW.0), (14) 


where denotes a sum or integral, a sum over discrete eigenvalues and an 


integral over continuous eigenvalues. 


1.3. THE HARMONIC OSCILLATOR 


The quantum mechanical harmonic oscillator is described by operators @ 
and # satisfying the canonical commutation relation, 


[4, p] = ih. (15) 


The Hamiltonian operator is (with no forcing term) 
J 
A= om p? +4mw’9?. (16) 


To find the eigenvalues of this operator, it is convenient to introduce a 
non-Hermitian operator @ analogous to the classical complex variable z, but 
differently normalized, defined by 


_ mog+ip 
~ (Qhma)* * ce 


From (15) it is easy to obtain the commutation relation 
[a, a*] = 1. (18) 


Substituting the inverse relations to (17), 


a=(~-) ara, p= 


4] es 
2mo ) po ) a) 
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into (16), we obtain 
A= tho(aa* + a*a) = hw(a*a+4) = (N+ ho, (20) 


where N = @*@. 
Thus it suffices to find the eigenvalues of N. If |n) is an eigenvector of N, 
satisfying 


N|n> = nln), (21) 
then also 
A\n> = E,\ny, 
where 
E, = (n+4) ho. (22) 


Now from (18) we easily find that 


[a, N] = aa*a—a*aa = [4, a*]a = 4, 
and similarly that 


[a*, N] = —a*. 
Thus by (21) 
Naln> = (@N—€)\n> = (n-1)ajn), (23) 
and 
Na*|n> = (@*N-+4*)|n> = (n+1)4*\n). (24) 


Thus 4|n> and 4*|n> are also eigenvectors of N’, wtih eigenvalues n—1 and 
n+ 1, respectively. Starting from any eigenvalue n, we can go up and down a 
ladder of eigenvalues by unit steps. This ladder can only stop if, after some 
point, further application of @ or @* leads to a vector which vanishes. Now the 
norm of @|n> is easily evaluated. Its square is 


<n|a*a|n> = <n|N\n> = n. (25) 


Since this is a necessarily non-negative quantity, none of the eigenvalues can 
be negative. Moreover, it is zero only if n = 0. Thus, in order that the ladder 
should terminate before reaching negative eigenvalues, it must stop atn = 0, 
which means that the eigenvalues must be integers, n = 0, 1, 2, ... . Corres- 
pondingly, by (22), the eigenvalues of energy must be 4ha, sho, Sho, .... 
We have an equally spaced spectrum, with spacing ha. 

From (23) it follows that ajn> must be proportional to the eigenvector 
|z—1), and by (25) the factor of proportionality must be n* multiplied by a 
phase factor, which can be chosen to be unity by a suitable choice of the 
arbitrary phases in the eigenvectors. Thus 


aln> = n?|n—1). (26) 
To satisfy (21) we must then have also 4*|n—1) = n(n), or 


a*|n> = (n+ 1)?|n+1). (27) 
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As we have seen, the lowest eigenvalue E, of the Hamiltonian (16) is not 
zero, but 4hw. This “zero-point energy” can have observable consequences if 
we consider for example a system in which the frequency is variable. However, 
so long as we stick to a single oscillator of fixed frequency, it is merely an 
additive constant in the energy, and therefore unobservable. It will be con- 
venient in what follows to subtract this constant, and consider the system 
defined by the Hamiltonian 


1 
A= oe p? +4mw*9? —tho = hwa*a. (28) 
Its eigenvalues are of course E, = nhw, n = 0,1, 2,.... 


1.4 COHERENT STATES 


We now wish to examine a particular set of states constructed out of the 
energy eigenstates of the harmonic oscillator. They have come to be known as 
“coherent” states, although the name is really more appropriate to the multi- 
mode case considered later than to the present single-mode system. They have 
properties which make them particularly convenient to use in discussions of 
the classical limit of the quantum oscillator. 

For any complex number a, we define the state-vector 


ja) = im exp (Hal). (29) 


For a = 0, this is simply the ground state |0>, but for any other value of «, 
it is a state with some uncertainty in energy. The probability that the energy is 
E,, follows a Poisson distribution, 


2n 
Klay? = 2 exp (— |e) 


The scalar product of two such states is 


fea) siti 


<alB> =) — 


exp (—4a|”)exp(—41B|7) = exp (a*B— 41a” —416|’). (30) 


The states are therefore never orthogonal, although the scalar product is very 
small if «— f is large. 

Now let us operate with the lowering operator @ on a coherent state |«>. 
Using (26), and then replacing 2 by n+ 1 in the sum, we obtain 
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a" 


(n!)? 


ala) = Y) n¥n—1)- = exp (—4lal?) 


_/ 


ym aa exp (—4]a|?) = ala). (31) 


The coherent states are therefore eigenstates of a. (Recall that a is a non- 
Hermitian operator, and so can have complex eigenvalues, and non-ortho- 
gonal eigenvectors.) By conjugation, we also have <a|4* = a*<a]. It follows 
that the expectation value of any normally-ordered product (one in which all 
factors of 4* precede all factors of a) in a coherent state is given by 


Cala*"a"|a> = aq", 
Now also, since 4|0> = 0, we have 
<0|(A* +a*)"(A+a)"|0> = oq" 
More generally, any operator function F(a, a*) oe be written, using the com- 


mutation relation (18), as a sum of normally-ordered products, and it follows 
that 


Cal F(a, a*) Ja> = <O|F(@+a, a*+a*)|0). (32) 


In particular, it follows from (19) that 


2h \+ 
<algja> = (=) Rea, <alpla> = (2hma)? Im a. 


Moreover, since the uncertainties in g and f are unaffected by adding constant 
terms to 4 and 4*, they are the same as in the ground state |0), namely 


h hma@ 
Ag? = —— Ap? = ——. 
q > ip 5 


Thus the product of uncertainties is 
Aq Ap = h/2, 
which is the minimum value it can attain in any state. 


Next, let us examine the time-development of a state which is initially a 
coherent state. If |, 0> = |«>, then from (14) we find 


* 
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IW) =D lnpexp (— inet) 7 ye exp (—Hal?) 


, n 
=F ny BSE ROF exp (HH?) = [exp (—io)>. (33) 
(If we had retained the original Hamiltonian (16) instead of the form (28) with 
the zero-point energy removed, we should have found an additional overall 
phase factor exp (—4iwt), which is of course of no physical significance.) We 
see from (33) that a state which is initially a coherent state is a coherent state 
at all times. The label « follows a circular path in the complex plane, described 
with constant angular velocity w, exactly as in the solution of the classical 
oscillator. The uncertainties in @ and p remain fixed, while their expectation 
values oscillate with the classical angular frequency w. A coherent state is thus 
the quantum state which most closely approximates the motion of a classical 
oscillator. We note that the expectation value of the Hamiltonian (28) is 


CalA|a> = helal?, 


which is exactly equal to the energy of a classical oscillator with the same 
amplitude. (In terms of the original Hamiltonian we would of course have in 
addition the zero-point energy term.) 

A remarkable property of the coherent states is that any operator A is com- 
pletely determined by its expectation values in all coherent states. For by (29), 


xm Nn 


a ae ar a a 
<alAlay exp lal? = Scag Cm), 


so that the left hand side may be regarded as a generating function from which 
all the matrix elements of A between energy eigenstates can be recovered. 

Thus to prove any operator equation it suffices to prove that its expectation 
values in all coherent states are satisfied. We may use this in particular to 
prove a completeness relation for the coherent states themselves. For any co- 
herent state |f> we have, by (30), 


d2 2 
[Bier <aipy = | exp (lap?) = 1 = <B>, 


vi 


where d?a = d(Re «) d(Im «). Thus it follows that 


(= la <a| = 7. (34) 


od 


20 T. W. B. KIBBLE 


The coherent states in fact form an overcomplete family. They span the entire 
space of state-vectors, but are not linearly independent. 


1.5 THE FORCED OSCILLATOR 


The coherent states also play an important role in the quantum mechanical 
treatment of the forced oscillator, a problem which as we shall see later is 
essentially the same as that of an electromagnetic field in the presence of a 
prescribed classical current distribution. 

The forced oscillator is described by the Hamiltonian 


1 h 
Bea) t mw’? —tho—4 f(t) = h@a*a — (= =a) (a+ 4*) f(t). 


We know that in the absence of the forcing terms there are solutions of the 
Schrédinger equation in which the state is always a coherent state, and it is 
natural to ask whether this is still true of the forced oscillator. We therefore 
seek a solution in which |, t> = |a(t)> exp [i6(t)]. Taking the scalar product 
of the Schrédinger equation with an arbitrary fixed coherent state |B>, we 
have 


0 
ih =| <Bla(t)> exp [ip(t)] | = <B|Ala(t)> exp [6]. (35) 


Clearly, it is sufficient (as well as necessary) to ensure that this equation is 
satisfied for arbitrary values of f. 
Now, 


7) a) 
py “Plated = => exp [B*ax(t) —4$1B|? —4]oe(t)|7] 


= <pla(n>| 6S — 45 (aj *), 


and 


h \t 
<ALAIA(D> = <Bla(d)> [hapa (—"—) arb 100). 
Thus, substituting in (35), we obtain 


in| oe — 4-5 (a al?)| - -1£ = hope (=) (+B) £(2). 


This equation is self-consistent inasmuch as both sides are linear functions of 
B* (in consequence of the linearity of the Hamiltonian in 4*). We may equate 


QUANTUM ELECTRODYNAMICS 21 


to zero the coefficients of 6* and 1, and thus obtain two equations for da/dt 
and d@/dt. The first of these is essentially the same as the classical equation 
(6), namely 


a cine 36) 
dt (2hma)* ( 
Using this equation, we can eliminate the term in d(|a|”)/d¢ from the second 
equation, and obtain an equation for d@/d? alone, namely 


d f(t 
a = Eee oo Re a(t). (37) 
These equations can be solved with any given initial conditions. Thus we see 
that if the system is initially in a coherent state, then at any later time it will 
again be in a coherent state. 

In particular, let us suppose, as before, that f vanishes except during some 
finite time interval, tg<¢<t,, and that for t<fo, the system is in its ground 
state |0>. Then the solution of (36) is 


i , ta LY ? 
= a { dt’ f(t’) exp [—io(t—r')], 


to 


as in (7). At any time ¢>1,, the system will be in the state 


[W,t> = |e, exp (—iat)) exp (19), (38) 
where : i ry 
i .,__ _if(@) 
“= Ghimayt i dt f(t) exp (iat) = @Ghmat (39) 
and 


t1 t 
1 , , Py ¢ 
@; = sean | arnt { at f(t’) sin w(t~—t’) 


+0 ~ 
ee. P | dw’ |f(w’)|? 
~ 2hma 2x wa’ 
—-@ 
where P denotes a principal value integral. By symmetry we can also write the 
solution as 

+o ~ 

1 dw’ |f(@’)|? 
* Gm?) On oo” om 
0 
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We see therefore that the effect of applying an external force to a harmonic 
oscillator in its ground state is always to leave it in a coherent state, (38). By a 
suitable choice of the external force f(t), any given coherent state can be 
reached in this way. 


2. CLASSICAL ELECTRODYNAMICS 


2.1 POTENTIALS AND GAUGES 


The basic equations of classical electrodynamics are Maxwell’s equations, 
together with the Lorentz force equation 


moe = e[E(r)+v~a Bir) ], v=— (41) 


which describes the motion of a particle of charge e and mass m in an electro- 
magnetic field. 

In rationalized units, and in the absence of dielectric or magnetic media, 
Maxwell’s equations are 


OB 
V.B=0, VAR = 0 (42) 
and 
cE 
&V.E = p, Ho” * VAB— fo = =j, (43) 


where p andj are the electric charge and current densities. 
From (43) there follows immediately the continuity equation 


op 
—+V.j=0 44 
re j (44) 
which expresses the microscopic conservation of charge. 
From (42) we may infer the existence of a vector potential A and a scalar 
potential ¢ such that 


B=VaAA, E=—~—-—Vé. (45) 
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These potentials are not unique. The fields E and B are unchanged if we make 
any gauge transformation, 


A-A-VWA, 6+, (46) 


where / is any function of space and time, This arbitrariness may be removed 
by imposing a gauge condition. For our purposes, the most convenient choice 
is the radiation gauge, defined by the condition 


V.A=0. (47) 


If we start with any potentials satisfying (45), then we can pass to the radia- 
tion-gauge potentials by making a gauge transformation (46) in which the 
function 4 is chosen to be a solution of V72 = V.A. 

If we substitute for E from (45) into the first of the equations (43), and use 
the radiation-gauge condition (47), we obtain 


pox EoV*o = p- 


if the charge density p vanishes outside some finite region of space (or at 
least tends rapidly to zero as roo), then we can solve this equation explicitly 
in the form 


pa’) 
4ne,\r—r'| 


g(r) = [arr 


We shall write this equation symbolically as 


1 
= -—— 0. 4 


Thus the radiation-gauge scalar potential is just the usual potential of electro- 
statics. 

Using (48) we may eliminate ¢ and obtain an equation for A alone. When 
we substitute from (45) into the second of the equations (43), we obtain a 
term 
1 dp 1 


ae WE ae 


V.j, 


by (44). Hence, using the standard vector identity 


VaA(VAA) = VV.A — V2A, 
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and the radiation-gauge condition (47), we obtain 


7A 7 ; 
207A — Ho 1V2A =j-V 


a Vi =r (49) 
say. The right hand side of this equation is the divergenceless, or “‘transverse’’, 
part ofj, satisfying V .j, = 0. 

When j is identically zero, A satisfies the wave equation, which describes 
waves propagating with velocity c = (€949)?. A particular solution of this 
equation is a plane wave, 


A(r, t) = Re [(Ao exp (ik . r—i@, t)], (50) 


where «, = clk|, and Ay is a constant complex vector satisfying k. Ay = 0. 
The corresponding electric and magnetic fields are given by 


E(, t) = Re [iw, Ap exp (ik . r—i@, t)], 
B(r, t) = Re [ik A Ao exp (ik. r—ia, t)]. 


The general solution is of course a linear superposition of these. 
From Maxwell’s equations (42) and (43) we find 


(51) 


0 
“5 Geo E*+4p9* B?) = E. (uy! VAB—j)— uo 1 B.VAE. 


Using the standard vector identity 
V-(anb) = b.VaAa—a. Vab, (52) 


we may write this in the form 


Ou 
—+V.S= -E.j, 
ar + j (53) 
where 
u = $(6) E+ * B’) (54) 
and 
S = uo 1 EAB. (55) 


The right hand side of (53) is the negative of the rate at which the electric 
field does work on the currents -per unit volume. Thus we may interpret (53) 
as the continuity equation for energy, if we identify u with the energy density 
in the field, and S with the energy flux vector (Poynting vector). In particular, 
the total energy stored in the field is, 


Hive i dr 4(¢9 E2 +197! B). (56) 
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It may be noted that the contribution to (56) which comes from the “‘longi- 
tudinal” part —V@ of E can be cast by partial integration (Green’s theorem) 
into the form 


p®) p(t’) 


: 57 
4neo|r —r’| G1) 


-4 | reg V26 = 4{ Prop =4 | dr{ ar’ 


This is the electrostatic energy corresponding to the charge distribution p. 
We note also that for a plane wave (51) the energy density and energy flux 
vector, averaged over a period of oscillation, are 


(Way =1le,  <Sdav = Ik, (58) 
where the intensity J is given by 


= 4ce9@,7Ao* . Ao. (59) 


2.2 HAMILTONIAN FORM OF EQUATIONS OF MOTION 
We now wish to show that the equation of motion (41) for a charged parti- 
cle may be derived from the Hamiltonian function 


H = 5—[p-eA(e, )P +ep(t,0). (60) 


The first set of Hamilton’s equations is 


dr oH 1 
= — = — = —(p—eA 61 
Vv dt ap (p—eA), (61) 


where we have used the notation 0/dp for the gradient operator with respect 
to the momentum vector p. Since r appears in H only through the potentials 
A and @, the second set is 


dp oH e 
Reto ee aaa - (p—eA)—eVo, 
dt or m a aL a 
which may be written 
2 = e(VA). v—eV¢. (62) 


Now, since A depends on ¢ both explicitly and through its dependence on r, 
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we have 
wen @ 


Hence, differentiating (61) with respect to the time, and using (62) and (63), 
we obtain 
dy dp dA 


OA 
— = —~e—— = e(VA). v—eVd —e-—- — ev. VA 
mT aye ap e(VA) .v—eVgo—e ay ev 


a e(-Vo- S) teva (Vna) 


= e(E+v~B), 


by (45). Thus we have shown that the Hamiltonian (60) correctly reproduces 
the equations of motion (41). 

We note that the first term in H is numerically equal to the kinetic energy 
of the particle, 4my’, while the second term is the electrostatic potential energy. 
This second term is of course included in (57). Thus to obtain the total Hamil- 
tonian function for the system comprising both the electromagnetic field and 
the particles, this term in (60) should be omitted if it is included in (56). 


3. SEMICLASSICAL ELECTRODYNAMICS 


3.1 PERTURBATION THEORY 


Before going on to discuss semiclassical electrodynamics, it will be useful to 
review briefly the elements of quantum-mechanical perturbation theory, in 
its time-dependent form. 

We suppose that the Hamiltonian operator may be written in the form 


where Af ,(t) may be treated as a small perturbation, and where it is assumed 
that we already know the eigenvalues E,, and eigenvectors |n) of Ao, satisfying 


Aio|n) = E,|n). (65) 


We then seek a solution of the Schr6dinger equation for given initial conditions 
as a series in powers of A. 

To do this, we expand the solution |, t> of the Schrédinger equation in 
terms of the eigenvectors |n), 


Wt) = > In) ¢,(¢). 
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Using (65) we may write the Schrédinger equation as 


dc, 
dt 


iY |ny —™ = YE,In) ¢,(1)+¥, A, |n> g(t). 


Then, taking the scalar product with an eigenstate |m> and using the ortho- 
normality conditions (11), we obtain 


ih 


d _ 
na ie ott Cm) +), <m|H ,(t)|n> c,(t). 


It is convenient to extract the time dependence of the unperturbed system by 
setting 


c,(t) = u,(t) exp (—iE,,t/h), 


so that the equation becomes 
. du, ry . 
ih SP =D <mlll (0) In) exp (iy) 9(0) (66) 


where o,,, = (E,,—E,)/h. 

Now let us suppose that initially, at t = 0, the system is in one of its eigen- 
states, |W, O> = |), say. Equivalently, w,,(0) = 6,,,. We take this as our zeroth 
order solution for all ¢, 


Up!)(t) = Son 
Then the complete solution may be expressed as a series 


Um(t) =D) Um (t), 


r 


where higher terms are obtained by iteration from (66). Explicitly, u is ob- 
tained by solving the equation with u replaced by u“~?? on the right, which 
yields 


t 
un~?(t) = > y far <m| Ai, (t’) |n> exp (i2yny t’) U6 (1). (67) 
0 
We shall consider explicitly only the first-order solution 
t 
p(t) = — | dt’ <m|FT,(t") n> exp (ign t’). (68) 
0 


To be specific, let us suppose that A, is periodic in time, with the form 


A,(t) = V exp (—iwt)+ V* exp (iat). (69) 
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Then we can perform the integrals and obtain 


ante, , _ 2 i= . 
in (8) = Con] ny = EEO ONT e (apy = Pa 01 


(70) 


Because of the presence of the denominators, the transition probability from 
state |n> to state |m> can be large only if @,,, ~ --@. Let us suppose that 
mn  @, SO that the second term in (70) is negligible. Then the transition 
probability per unit time to state |) is, in the limit of long time, 


Winn = lim (1/t) |Uya?(#)|? 
t~ co 


. 1 jl—exp[i(a,,,—@)t]|? 
= ||P jn)? lim — | PPE Om = OP! 
i<m|V |n>| um Ao. a) 
Now clearly if ,,,,4@, then the limit is zero, because the squared modulus is 
always finite. On the other hand, for o,,,, = , the quantity inside the modulus 
sign depends linearly on ¢t, so that the limit is infinite. The limit therefore 
behaves like a Dirac delta function, 6(@,,,,—@). We note that 


h( Onn ~ RP’ 


+a 
f deo ee 2 J« sites 2n 
= = 55 
independent of t. Hence the integrand must tend to the limit (27/h?)5(@an — @). 
Thus we obtain for the transition probability per unit time the expression 


Vina = Ga KP? 5 (gg 0) = = [drm Plnd/? 5(Eg—Eq—ho). (71) 
If we have a strictly periodic perturbation, and discrete energy levels, we can 
never get a finite transition probability. To obtain a finite result we have either 
to integrate over a range of values of the final-state energy, or over a range of 
values of the perturbing potential. 
We can calculate similarly the transition probability for a downward 
transition, using the second term of (70). The result is 


2 Pa 
Won = Sz KKM] P*|2>1? 5(Oyy +) = |X| P*|n>|? 5(Eq—Eq +h). (72) 


mn h? 


Comparing these two expressions, (71) and (72), we see that W,,, = Wam always. 
This is an expression of the principle of detailed balance. 
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3.2 SEMICLASSICAL TREATMENT OF INDUCED TRANSITIONS 


In semiclassical theory, we treat the electrons quantum mechanically, but 
regard the electromagnetic field as an externally prescribed classical field. The 
Hamiltonian operator for an electron in an atom, and in the presence of an 
external radiation field is obtained from (60) by replacing r and p by operators 
f and p. It is 


A= eas [p—eA(f, t)]? + V (8), (73) 
2m 


where V(f) is the (Coulomb) potential responsible for atomic binding. We 
may write this Hamiltonian in the form (64) with 

Ay =— P+V() 

2m 
and ‘ 22 
A(t) = -——[p. A(#,1)+A(8, t). §] + —— ACE, 2)? 
2m 2m 
The difference between the two terms in the square brackets is 
p.A-A.p=-ihV.A=0 


in the radiation gauge. Moreover, if we work only to first order in e, then the 
last term in H, is negligible, and we may take 


AY) = -—p A(t, 1). (74) 


Let us suppose that the external field is a plane wave, of the form (50). Then 
the perturbation (74) is periodic in t, and can be expressed in the form (69), 
with @ = @, and 


0 = ~— §.A, exp (ik.?). 
2m 


Thus according to (71) the transition probability per unit time for an induced 
absorption process is 


2n e 
W = ane 


2 
mn Fe Ao . <m|p exp (ik . f)|n> 5(Omn— Dy) 


2m 


For optical frequencies, the wavelength of the electromagnetic wave is long 
compared with the atomic dimensions. Thus it is a good approximation to 
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replace the factor exp(ik.r) in the matrix element by 1. This is the dipole 
approximation. Using the relation 


[f, Hy] =—Pp, 
Mm 


we can obtain a relation between the matrix elements of p and of f, namely 
ih — 2 ‘e 
—mlbla> = CnlLF, Holla = (En— En) mt pny = hp m|f|ny. 


Hence we find 


a ee eee 
Wmn = a On Ao™ Aola.Cm|t|n>|"S(mn — Ox), (75) 
where a is the complex unit vector in the direction of Ap, a=Ao/(Ap*.Ag)?- 
As expected, the transition probability is proportional to the intensity J given 
by (59). 

If we suppose that the field is an incoherent mixture of components with 
different frequencies, then we have to sum over all frequencies. If the intensity 
in the frequency range between w and w+dq is I(w)dw, then the total 
transition probability per unit time is 


Wn = (27)? = I(@qy)|a<mnl n>, (76) 


where « is the fine structure constant, a dimensionless number, 


~—. (77) 


This discussion covers only induced transitions, not spontaneous ones, 
which can only be treated correctly in the fully quantized theory. 


4. QUANTIZATION OF THE ELECTROMAGNETIC FIELD 


4.1 MODE DECOMPOSITION OF THE ELECTROMAGNETIC FIELD 


In order to quantize the electromagnetic field, it is convenient first to decom- 
pose it into modes. 

Let us begin by considering the field enclosed in a cavity of volume, V, and 
satisfying prescribed boundary conditions on the walls. For example, we may 
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choose the walls to be perfectly conducting, which requires that the electric 
field must be normal to the walls, 


nAE=0 (78) 


on the boundaries, where n is the unit normal. 
In the cavity (where for the moment we assume there are no charges or 
currents), the radiation-gauge vector potential satisfies the wave equation 


07A 
Eo Pury = jig? V*A = 0, (79) 
ot 
together with the appropriate boundary conditions, for example 
nAA = 0. (80) 


We now look for periodic solutions of (79). To do this we have to find all 
values of k* for which the equations 


(V7 +k) f, = 0, V.f, =0 (81) 
have solutions satisfying the boundary conditions, say 


If the subscript & labels the set of all such solutions, then we can expand the 
general solution of (79) in terms of these functions, in the form 


AG, t) = pe g(t) f(r), (83) 
where q,(t) satisfies the equation 
d2 

qt tO d= 0, (84) 


with wo," = ek*. 
The corresponding magnetic field, given by (45), is 


B(r, t) = »} a(t) V A f(r) (85) 


while the electric field may be written 


E(r,t) = — 2 a(t) f(r), (86) 


where p, = dq,/dt. With this definition, we may write (84) in the form 


ae = Pr ae = — OF "Oy: (87) 


32 T. W. B. KIBBLE 


These are the equations, in Hamiltonian form, of a set of harmonic oscillators 
of unit mass and angular frequency w,. Thus the electromagnetic field has been 
decomposed into modes, each behaving like a harmonic oscillator. 

It may be anticipated that, with suitable normalization of the mode func- 
tions f,(r), the Hamiltonian (56) of the electromagnetic field can be written as 
a sum of harmonic oscillator Hamiltonians. To show that this is indeed the 
case, we have first to establish certain orthogonality properties of the mode 
functions. 

Using the vector identity (52) with a = f, and b = Vaf,, we have 


V.[f,A(VAE,)] = (Vat). (VAL) — £,. [VA(V Af]. (88) 


We now integrate over the volume V, and use Green’s theorem to express the 
left hand side as a surface integral over the boundary S of V, 


[as fA WV Af, = [as Af, .(V Af,), 
s s 


which vanishes in virtue of the boundary condition (82). On the other hand, 
by (81), 
VA(VAft,) = VV .f, a V7 f, => k7f,. 


Hence we obtain 


| dr (VAf,). (VAL) = k? i d3r f,.f, = 2 | d°r f,. f,, (89) 


Vv v ¥ 


where the last equality follows by symmetry. Thus the integrals vanish unless 
k? = ]*, We shall see in a moment that the correct normalization to choose 
is 


eo | d°r f,,. f, = On. (90) 
4 
For then the first term in the Hamiltonian (56) becomes 
48 [ore =4 x VPP Zi) [ors f,=4 ) pe’, 
fi k Lt . k 
while the second may be written, using (89) and (90) ,as 


tuo" | d*rB? = 4 py dM om? { dr (V Af,) .(V Af) = 4 > Oy,” 


V 
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Thus the total Hamiltonian of the electromagnetic field is 
Hy = d 4p? +047q,7), (91) 


in agreement with (87). 


4,2 FIELD QUANTIZATION 


Now that we have reduced the Hamiltonian for the electromagnetic field to a 
sum of harmonic-oscillator Hamiltonians, the quantization of the system is 
straightforward. We have to replace the classical variables g, and p, by opera- 
tors 4, and p, satisfying the canonical commutation relations (9). We choose 
the Hamiltonian of each oscillator in the form (28) with the zero-point energy 
removed, and therefore obtain for the total Hamiltonian the expression 


Ay = x 4(p,? + 0,7 4,’ —ha,) = py hw, a,* ay, (92) 


where as before we have defined the non-Hermitian operator 
= On4y + ify, 
k (Qha,)? 
It may be noted that the removal of the zero-point energy is in this case 


essential if we want to have finite energies, because the sum over all modes of 
the zero-point energy, 


(93) 


> $ho,, 
k 


is infinite. The only observable energies are of course energy differences be- 
tween states, and these are finite. The Hamiltonian (92) is chosen so that its 
lowest eigenvalue, the energy of the ground state or vacuum state, is zero. 

The ladder operators (93) satisfy the commutation relations 


[4,, 4;] = [4,*,a,*] = 0, — [@y, )* ] = du. (94) 
The ground state |0> is characterized by the condition that 
a,/0> = 0 (95) 


for all kK. Other energy eigenstates may be obtained by acting with the opera- 
tors 4,* successively. For example, 


|k> = a,*|0>. (96) 


is an eigenstate of energy hw,, which may be interpreted as a state containing 
one photon in the kth mode, and 


Ik, [> = a,* @,*|0> (97) 
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is an eigenstate of energy h(w,+.@,), a two-photon state with one photon in 
each of the two modes labelled by & and /. (If k = /, it is a state with two 
photons in the kth mode. In that case, it should have an additional normaliza- 
tion factor 1/2*.) 

In general, each energy eigenstate may be labelled by a set of numbers 
Ny, My, ..., one for each mode. The number 7, represents the energy level of the 
kth mode, or equivalently the number of photons in that mode. Each n, may 
range over the values 0, 1, 2, ..., but only a finite number of them may be 
different from zero. Denoting the set of all the numbers 7, by {1}, we may write 
this eigenvector as 


[{n}> = |, May woey Mg wae ye 
The corresponding energy eigenvalue is 
Ew => > Ee = » ny, ho,. (98) 


This state is of course an eigenstate of each of the operators N, = 4,* 4,, with 
eigenvalue n,, and also of the total photon number operator 


Ves: 


with eigenvalue 


a= Y Ny. 
kK 
The effect of any one of the operators 4, or 4,* on a state |{n}) is to change 
the value of the label 1, by one unit, while leaving all the other labels unaltered: 


yl {2} > = (MP Wa, Nay ey Mas Ds 


(99) 
a,*\{nt> = (m.t+1)*| ny, no, -..5m+1, ...> 


Their effect is to remove one photon, or to add one extra photon, and they are 
therefore called annihilation and creation operators respectively. 

Just as for a single oscillator, we can again define coherent states. Each 
coherent state is labelled by a set of eoniplex numbers {a} = {a,, %,...}. It is 
defined by 


i{a}> = x led Wl Gor a > exp(— Fal”). (100) 


M1, 2, - oF 


Using (99), it is easy to prove, as before, that these states are eigenstates of 
each of the annihilation operators a,. They satisfy 


A,l{a}> = %|{a}>, (101) 
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and hence also 


C{a}] Ay* = 04,* C {a}. (102) 


It follows from these relations that 


<{ao} Mul {oe} > = < {ax} ay @yl{a}> = [ay]. 


Thus, in order to ensure that the expectation value of the total photon number 
is finite, we must impose on the set of numbers «, the condition 


C{a} INI {a}> = 2 lal? < 0. (103) 


(This is also necessary to ensure that the product of the exponential factors in 
(100) is convergent.) 
The scalar product of two coherent states is easily evaluated, and is given by 


<{a}I{B}> = exp (x (y* fs ¥1eu!?—41641). (104) 


With the restriction (103), this sum is always convergent. 

It is also easy to prove, as before, that a state which is initially a coherent 
state always remains coherent. In fact, if |W,0> = |{«}), then |, ¢> = |{a(r)}), 
where 

a(t) = a, exp (—ia, t). (105) 


Each a(t) follows the classical circular path, described with the appropriate 
constant angular velocity — @,. 

Using the eigenvalue equations (101) and (102), we can evaluate the expec- 
tation value of any normally-ordered product of the operators 4, and 4,*. For 
example, using the relations (85) and (86) connecting the electromagnetic 
fields to the generalized coordinates and momenta, we easily obtain 


2h \# . 
BED» = {o(9}1 BOM(AC}Y = FE (—~) Vase) Re fapexp (— io], 


(106) 
and 
CE(r,1)> = Cfo} EM {a@}> = » — (2he,)* f,(r) Im [o, exp (— ia, 1). ' 
(107 


These expectation values are of course solutions of Maxwell’s equations. 
Correspondingly, for any solution of Maxwell’s equation satisfying the chosen 
boundary conditions, we can find a coherent state in which the expectation 
values of the field operators are equal to the fields of this classical solution. 
The coherent state is the quantum state which best approximates the given 
classical field. 
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4.3. INTERACTION WITH A CLASSICAL CURRENT DISTRIBUTION 


We can also show that the coherent states are the states produced by inter- 
action with a classical current distribution, which plays the role of the forcing 
term in the harmonic oscillator considered earlier. 

Let us suppose that within the volume under consideration there is an ex- 
ternally prescribed current distribution, j(r, t), which we assume to vanish on 
the boundary. First, to find the appropriate Hamiltonian, we consider the 
purely classical problem. In the presence of this current distribution, the 
radiation-gauge vector potential satisfies eqn (49), 

2 
coat — Uo VA = j — eve = Jr. (108) 
Actually, the appropriate definition of j, in this case is slightly different, 
because instead of requiring ¢ to vanish asymptotically as roo (the condition 
which led to (48)), we should now require it to vanish on the boundary of our 
volume V. It is still true, however, that j; is the divergenceless part of j. 

Now, if we insert the mode decomposition (83) for A, then we may isolate 
the term corresponding to a single mode by taking the scalar product with 
the mode function f, and integrating over V. Using the orthonormality re- 
lation (90), we obtain 


d? ; 
GE tO Ie = Je (109) 
where 
h(t) = | x 4) 50,0). (110) 
Vv 


We have dropped the subscript T on j in (110), because the longitudinal part 
of j actually does not contribute. By using Green’s theorem its contribution 
may be transformed into the sum of an integral involving V.f, = 0 and a 
surface integral which vanishes because of the assumed boundary condition 
for ¢. 

The equation of motion (109) is that of a forced harmonic oscillator. It may 
be derived from the Hamiltonian function H = H,+H,(t), where 


H,(t)= — 2 At); (113) 
which may also be written in the form 
Ho =- fax ret (112) 
V 


using (110) and (83). 
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We can now immediately write down the corresponding Hamiltonian for 
the quantized electromagnetic field interacting with a classical current. It is 


A = A,+A,(t), where 
Ax) = — Fat) = =| d°r Aj. (113) 


Since this Hamiltonian is a sum of terms, each containing the variables cor- 
responding to only a single mode, it affects each mode separately. Our pre- 
vious analysis of the forced oscillator applies without modification to each 
individual mode. In particular, let us suppose as before that the external 
current distribution j(r,¢) is zero except during some finite time interval, 
to<t<t,, and that for t<f, the system is in its ground state, |0>, or in other 
words that no photons are present before the current is switched on. Then at 
any time t>tf,, each mode will be in a coherent state, given by (38). Thus the 
system as a whole will be in a coherent state, 


ly, t> = |{a(t)}> exp (id), (114) 


where a(t) = a, exp (— ia, t), with 
"1 rx ) 
2, I ‘ : — Yr 
oh, = Ghop)® fora exp (ia, t) Oho»? (115) 
to 


as in (39), and 
do Lie 


on Op —@* 


== 


LP (116) 


+ 00 
19) 
as in (40). 

It is easy to verify, using (106) and (107), that the expectation values of the 
fields in the state (114) are precisely the classical fields one would obtain by 
solving Maxwell’s equations in the presence of the prescribed current distri- 
bution, with the given boundary conditions, and the initial condition that the 
fields vanish for t < fp. 


4.4 WAVE VECTOR REPRESENTATION 


So far we have been considering an electromagnetic field confined in a cavity 
of finite volume V. However, we often want to consider a field which is not so 
confined. We can do this by considering the limit of infinite volume, but in so 
doing it is more convenient to start with a different set of boundary conditions. 
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Let us choose a cubic volume V = L*, and impose periodic boundary con- 
ditions, 


O(X+L, y,z) = H(X, y+L,z) = O(x, y, Z+L) = O(, Y,2) (117) 


on each of the field components. 

Now, since the differential equations (81) and the boundary conditions (82) 
satisfied by our mode functions were real, we could always choose these func- 
tions f, to be real also. (The real and imaginary parts of any complex solution 
are themselves solutions.) The same is true with the boundary conditions 
(117). However, it is more convenient to choose the mode functions to repre- 
sent plane waves. Each mode is labelled by a wave vector k and a polarization 
index s = 1, 2. To satisfy the periodicity conditions (117), we must restrict the 
allowable wave vectors by the conditions 


k,L=2nn, k,L=2nn,  k,L = 2nn,, (118) 


wher n,, ny, n, are integers. The mode functions are chosen to be 


fy,5 (1) = e,(k) exp (ik .r), (119) 


1 
(Eo v)? 
where e, (k) and e,(k) are mutually orthogonal unit vectors perpendicular to 
k. These functions are normalized so that 


eo | dr fr <° fi, y= Ox, k’ 6. s’* (120) 
V 


The use of complex mode functions necessitates some changes in our earlier 
formulae. In particular, in expansions like (83), (85), or (86), of real fields, the 
coefficients cannot any longer be taken to be real, and therefore in the quan- 
tum mechanical case cannot be identified with the Hermitian operators 9, or 
p,. Instead, we may separate the terms involving annihilation and creation 
operators, and write 


A(®) = OP). (ae ct ar. fe 
© =F (sr) (h.O+40, Ol, (121) 
and 
BO = F(A) 1a. 6..0-af 01, (122) 
k,s 


It is clear that these expansions reduce to the earlier ones when the mode func- 
tions are real. 
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As before, the annihilation and creation operators satisfy commutation 
relations of the form 


[ay Ss? ay, sl = [ax s? ay. sl = 0, (4, ss at, s] = Ox, k’ 6s, s‘° (123) 


From these it is a matter of straightforward algebra to compute the commuta- 
tors of the field operators. The commutator of any two field variables is given 
by a double sum over k, s and k’, s’, which is reduced to a single sum when we 
insert the relations (123). To perform the sum over s we use the fact that e, (k) 
and e,(k) span the plane normal to k, so that 


kk 
k2 


y e5,i(K) é,, (Ik) = 54, — = 5". (124) 
s=1 


(The right hand side is the projector on this plane.) The sum over k is of the 
form 


¥; exp [ik . (r—r’)] = V6,(r—r’), 
k 


where 6(r) = 6(x)6(y)6(z) is the three-dimensional Dirac delta function, or is 
obtained from this relation by differentiation. When we go through the alge- 
bra, we find that any two components of Ff (or any two of B) commute, as do, 
for example, £, and B,. A typical non-vanishing commutator is 


8.0); 8,1 = 9,9: 
& Oz 


We learn therefore that although it is possible in principle to measure with 
arbitrary accuracy parallel components of E and B, this is not the case for 
perpendicular components. 

We are now in a position to consider the infinite-volume limit. We note that 
according to (118), the allowable wave vectors are distributed uniformly in k 
space, forming a cubic lattice with spacing 27/L. The volume of k space per 
point is thus (2z/L)*? = (27)*/V. Thus in the limit as Loo, the spacing be- 
comes arbitrarily small, and the sum over k goes over to an integral, 


(2 


a ro | d3k. (125) 


Correspondingly, in this limit a Krénecker delta symbol goes over into a 
Dirac delta function, 


V 
Gar daw > 5s(k-k). (126) 
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It is convenient to absorb a factor of V* into the definition of the annihilation 
and creation operators. We set 


a,(k) = V7, ,. (127) 


Then the commutation relation (123) becomes 
[a@,(k), a,* (k’)] = Vox, k’ bs, s' (2z)° 63(k —k’) bs, s’s (128) 
by (126). Moreover, the decomposition (121) may be written, using (119), as 


A(r) = — 7D (a io) [@,(k) exp (ik . r) + @,*(k) exp (—ik . r)] e,(k) 


[S53 (ah \ [4,(k) exp (ik x) +4,*(k) exp (—ik..1)] (0). 


(129) 


by (125). This then is the correct mode decomposition for the infinite-volume 
case. 

It should be remarked that at least two other normalization conventions are 
in common use. Many authors introduce an extra factor (27)~>/? in (127), 
thereby eliminating the (27)* from (128), and replacing it in (129) by (22)9/?. 
On the other hand, particle physicists who are interested in making formulae 
obviously invariant under relativistic Lorentz transformations tend to include 
an extra factor of (2w,)?. 

All our previous formulae can be carried over with appropriate modifica- 
tions to the infinite-volume case. Since the number of modes is no longer 
countable, it is impossible to label the states by the occupation numbers of the 
modes. However, we can still use the alternative labelling introduced in (96) 
and (97). We may define the one-photon state 


Ik, s> = &,*(k)|0>. (130) 
This state has a delta-function normalization, 
<k, s|k’, s’> = (22)? 63(k~k’) 6, y. 1 (131) 


An arbitrary one-photon state is a linear combination of states of this form, 


d*k 
W=E | Gor berW.tb. 


It is normalized if 


Slay > We? = 
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Similarly, the basic two-photon states are 
Ik,, 5,3 K2, 82> = &,,*(k,)4,,*(k,) |0>. (132) 
Their orthonormality relations are 
Ky, 513 Ko, Salky’, 51’5 ky’, 52) = (ky, 5y[Ky’, 51> (Kp, 52| Ko’, 52’> 
+ Ck, 51 [k 2’, 52’) (Ka, S[k’, 51/>. 
The basic states (130) and (132) are eigenstates not only of the Hamiltonian 


Ay = iF: - a ho, a,*(k) a,(k) (133) 


and the total photon number operator 


but also of any other linear function of the operators 4,*(k)@,(k). In particular, 
we may define the total momentum operator, 


d3k 
Pp = | Sk a,*(k) a,(k). 1 
D | Ge tk at) 4.09) (135) 
Its eigenvalues in the states (130) and (132) are hk and h(k, +k,), respectively. 
A coherent state must now be labelled not by a sequence of complex num- 


bers, but by a complex function «,(k), defined so that 


A(k)|{a}> = o,(k)| {a}. (137) 


To see how to expand it in terms of the states labelled by k, s, we note first 
that for a single mode, 


a*"0> = (n!)*\n), 
and hence that (100) may be written 


{a} = exp (¥ayae") [0) exp (—4 5 lal’). 
Hence, going to the infinite-volume limit, we have 


I{a}> = exp @ { = a,(k) a."(k) 10> exp (-3 > { 7 06)). 
(138) 
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Expansion of the first exponential would yield an expansion in terms of n- 
photon states. We note also that the scalar product of two coherent states is 
now 


- 3 
<(eH1{B}) = exp (Ef 5 [a_"(R) Bx(K) —Hay(W)1?—416,00)171) (39) 


5. QUANTUM ELECTRODYNAMICS 


We now turn finally to the fully quantized theory, in which the system to be 
studied includes both the electrons and the electromagnetic field. The Hamil- 
tonian function for the corresponding classical system includes both the field 
energy (56), and the Hamiltonian function (60) for each electron. However, 
as we remarked earlier, we must be careful to avoid counting twice the Cou- 
lomb energy which is present in both. If we label the electrons by an index j, 
then the classical Hamiltonian may be written 


H= | d?r 4(e) E? + 7! B?)+ > [slPs—eAe P+ V(r) » (140) 


where V,(r;) is an external Coulomb potential, representing the effect of the 
atomic nuclei for the case of bound electrons. The Coulomb energy of the 
electrons themselves is contained in the first term, as the contribution of the 
longitudinal part of E. 

It is convenient to write the corresponding quantum mechanical Hamilton- 
ian as a sum of terms in which the free parts (independent of e) are separated 
from the interaction parts. We write 


A = (Aort+ Ac) + ¥ (Aoj+Ay;+A2). (141) 


Here Ag,, the free Hamiltonian of the electromagnetic field is the first term of 
(140) with the Coulomb energy removed, namely 


ener | d°r (¢9 By? + uy”! B) = ¥ ha, a,* a, (142) 
k 


In the present case, we are dealing with point particles, and thus the charge 
density is a sum of delta functions, 
p(r) = bz ed(r—r,). 
J 
Thus the Coulomb energy may be written 
2 
e 


Ae = /—_. (143) 
i 3 ms 4néo|r;—r;-| 
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Any term with j = j’ in this sum is clearly infinite. It represents the infinite 
Coulomb self-energy of a classical point particle. However, such a term is 
merely a constant addition to the Hamiltonian, which it is legitimate to ex- 
clude. The prime on the summation sign indicates that all terms with j = /’ 
are excluded. (There are in fact other contributions to the self-energy, cor- 
responding to processes in which an electron emits and then reabsorbs a 
photon, which must also be subtracted, but we shall not be concerned with 
these.) 

The free Hamiltonian for the jth electron (including the Coulomb potential 
of the nucleus if it is bound) is 


1 
Ay; = Fm Bi + Vilk)- (144) 
The interaction parts are 
e ~ 
Ay; = — = bj) - A) (145) 
and e 
A., = oy A(f;)’, (146) 


where A(f ') is given in terms of photon annihilation and creation operators by 
(83) or (121), namely 


x 4 
Ae) =F (5-) Cane +arKee (147) 


We can now proceed to treat this Hamiltonian (142) by perturbation theory, 
taking as our free Hamiltonian 


Ay = Age + ¥, Ay;. (148) 


The eigenstates of My may be labelled by the occupation numbers n, for each 
mode of the electromagnetic field, and by the quantum numbers n,; which 
label the energy levels of each electron. The various interaction terms will 
induce transitions between these eigenstates of Ay. The term A, , will induce 
transitions in which the state of the jth electron changes, while one photon is 
absorbed or emitted, while for H,, ; two photons must be absorbed or emitted 
(or one absorbed and one emitted). The Coulomb term A’, will induce direct 
transitions involving two different electrons, with no accompanying change in 
the photon occupation numbers. 
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5.1 SPONTANEOUS EMISSION 


As a first example, let us consider a system containing only a single electron, 
bound in an atom, and suppose that initially it is an eigenstate |”) of Ay, with 
no photons present. 

To first order in e, we need consider only the interaction term A,, not A, 
(n or H-, which of course does not appear for the single-electron case). The 
lowest-order transition probability to a final state | > is given by the analogue 
of (71) for a perturbation which contains no explicit time-dependence, 
namely 


2 
Wyn = Kf IAL In>!? 5(Ey Ey). (149) 


Now, since the intial state contains no photons, the term in A , containing 
a photon annihilation operator cannot contribute. We need only consider the 
second term in (147). The only final states for which there is a non-vanishing 
transition probability in lowest order are those containing a single photon, say 
in the kth mode, and with the electron in an eigenstate |m), that is, states of 
the same form | f> = |m;k)>. From (145) and (147), the corresponding matrix 
element is 


h 4 
ms kiAsIny = — = (5) Cons kb fe Dan. 


24 
m K! 


The operator appearing in the matrix element is a product of an operator 
involving the variables describing the state of the electron, f and f, and a 
photon creation operator, and the matrix element is therefore a product, 


Cn; Kp fy *(B) ay*|ny = Cmip .fe*(8) [n> <kldy* [0 
= <mlp f°) In) Sy. 


Thus the only non-zero term in the sum over k’ is that with k’ = k, and we 
have 


é 


h \4 
dom: KIA In = — (=) * cop .f.408 fd. 


In particular, if we use plane-wave mode functions (119) and assume, as in 
our earlier semiclassical calculation, that the dipole approximation is valid, 
then we obtain 
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<m;k, s|ff,|n) = — = : )’ ——. <mipin)-e,*(k) 


m 20, (e9V)* 
= —i ( A ; <mif *(k) 150) 
= — 10D soar) m|f|n> . e,*(k). ( 


What we are interested in physically is the probability of emission of a 
photon in a given direction within some angular range defined by a solid 
angle dQ. To find this probability, we have to sum over a small range of 
values of the wave vector k, using the relation (125) in the form 


ere vk __! [4 dk do = : [ 7 den, d0 (151) 
V¢ (2n)> (2x) nee OR oe 


Inserting (150) into (149), and performing this summation, we obtain for the 
probability per unit time of a transition from |n) to |m), with the emission of a 
photon in the angular range dQ, the expression 


Wald) = 27 YY Kosh, 1! 5m + 0) 


h 


OOD? smn 
2nc? 


dQ )) KXmltin> .e,*(k)/?, (152) 


using the definition (77) of «. The sum over polarizations may be performed 
using the formula (124). 

If we are interested only in the total transition probability from |n) to |m), 
then we have to integrate over all directions. Since 


k;k, 8x 
faa(o, - Pe) Foe 


the total transition probability per unit time is 


4 a0? nn 
W.. = a a |<m|f|n>|?. (153) 


mn 


5.2 INDUCED EMISSION AND ABSORPTION 


It is actually unnecessary to recalculate the transition probabilities for induced 
emission and absorption processes, because we can make use of a general 
theorem which allows us to reduce the calculation (at least in lowest order) 
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to the earlier semiclassical one. However, we shall also consider later a differ- 
ent, independent calculation, to show that the results are indeed the same. 

Let us suppose that the initial state contains a single electron, in the eigen- 
state |n> of Ay, together with a coherent state of the electromagnetic field. 
This choice is motivated by the observation that a coherent state is the quan- 
tum state which best approximates a classical field. Now, the measurement 
of the transition probability for some atomic transition may be regarded as a 
measurement of some property of the state of the electromagnetic field. In 
other words, we are at liberty to regard our atomic system as a piece of measur- 
ing apparatus with which we can measure some observable, which must be 
some function of the photon annihilation and creation operators 4, and 4,*. 
It is not necessary for our purposes to know exactly what function this is. We 
now apply the multi-mode generalization of the result (32), namely 


{OIF (Ax, Oe*)1 {a} > = COLF (A, + a, A,* +.0%4,*) |0). (154) 


This shows that the expectation value in question is the same as the cor- 
responding vacuum expectation value of an operator obtained from F by 
adding c-number terms to 4, and 4,*. On the other hand, a classical external 
electro-magnetic field would also be represented by just such extra c-number 
terms, added to these operators. Hence we may conclude that the probability 
of some atomic transition in the presence of an electromagnetic field which is 
initially in a coherent state |{a}> is equal to the probability of the same tran- 
sition in the presence of a classical external field, but with no photons 
present in the initial quantum state-vector. In lowest order this gives just the 
semiclassical result, although there will of course be higher-order correction 
terms corresponding to processes in which photons are emitted and perhaps 
reabsorbed by the atomic system. 

To verify this theorem in a special case, let us compute the probability of 
absorption of a photon. We begin by assuming that the intial state contains 
N photons, all in a single mode k. Since we shall be considering only this 
mode, we drop the mode label k for the moment, and denote the state simply 
by |n; N>. 

To first order in e, we again need to consider only the interaction term A,. 
Because it is no more than linear in the photon annihilation and creation 
operators, it can have non-vanishing matrix elements only between states 
whose photon numbers differ by + 1. These are 


h \4 
(m;N—1\Ay\n;N> = ——— (=) N# Cmlpp .£,(8) |n) (155) 


and 
e 


h \+ 
<m;N-+1|A,\n;N> = —— (=~) (N +1)? (mp .£,*(®) In). 
Kk 
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This matrix element must be substituted into (149) to give the transition 
probability per unit time. The argument of the delta function, the difference 
in energy between the final and initial states, is E,,-hw,—E,. Thus for an 
absorption process we obtain the result 


2x fe \2h is 2 
ete =7(<) a5, N KemlB £191? 5m — 04) 


or, in the dipole approximation 
aNa,c 
Wan = (2)? —*— |inlé 2) . €,(k) |? 5 (oq, — 2). (156) 


It is easily seen that it makes no real difference if we replace the state with 
definite photon number by a coherent state. A coherent state is a linear super- 
position of states with different values of N, but since each N leads to a final 
state with a different photon number, their contributions to the transition 
probability add incoherently. The net effect is simply to replace N in (156) by 
the expectation value of N in the coherent state. Since the intensity in an 
N-photon state is the photon number density N/V multiplied by the energy 
of each photon ha,, and by the velocity c, I = Nha,c/V, the result (156) 
agrees precisely with the semiclassical result (75). 

We can also consider initial states which contain photons in several differ- 
ent modes. If we choose a state containing n, photons in the kth mode, then 
we see that the absorption of a photon from one mode leads to a state ortho- 
gonal to that obtained by absorption from a different mode. Thus, again, 
these contributions to the transition probability add incoherently. The result- 
ing expression, obtained by summing over a range of values of k agrees with 
the semiclassical result (76). 


5.3. SCATTERING PROBLEMS 


As an example of a rather different kind, let us consider the scattering of 
radiation by an electron in an atom. We suppose that the electron is initially 
in its ground state, |g>, say, and that there is a single photon present in the kth 
mode. Thus the initial state may be denoted by |g; k>. We seek to calculate 
the probability of a transition to a final state in which there is again a single 
photon present, but in a different mode, and in which the electron is left ina 
state |m> (which may once more be the ground state, or may be one of the 
excited states, or indeed an unbound state, if the photon is sufficiently ener- 
getic to ionize the atom). The final state in which we are interested is therefore 
Im; k’). 

Now, since Af, is linear in the photon annihilation and creation operators, 
it cannot in first order induce transitions in which the photon numbers of 


48 T. W. B. KIBBLE 


two different modes change. Thus there is no matrix element to order e, and we 
have to go to order e? to obtain a non-vanishing contribution. In that order, 
there are two distinct contributions. In first-order perturbation theory, there 
is a contribution from the term A’, in the Hamiltonian, which is of order e?, 
and, moreover, Af, itself can contribute in the second-order of perturbation 
theory. 

Let us consider first the effect of A. Since we have to remove a photon from 
the kth mode, and create a photon in the k’th, the relevant term is the one 
proportional to 4,.*4,. Identifying the coefficient of this term from (146) and 
(147), we find that its contribution is 


Ale:k) = £ —* * 
<m;k\Aalesk> = > sear (mille *®) Og, (157) 


or, if we specialize to the case of plane-wave modes, 


2h 
<m;k’, s'|Aa|g;k, 8) = oF (ml exp Li(k—k’).Fllg> ey*(K), e,(4). 


2ME_ V(@, W,: ne 


If the frequency of the photon is sufficiently low to make the dipole approxi- 
mation valid, then we obtain a non-vanishing matrix element only in the 
special case m = g, so that the scattering is necessarily elastic. It is clear more- 
over, that in this case the matrix element would be the same for any initial 
state. This is what we should expect physically. In the long wavelength limit, 
the details of the atomic structure are irrelevant, and the photon scatters as 
if from a free electron. 

Let us for the moment ignore the contribution of A, and suppose that (158) 
represents the complete matrix element. Then we may substitute it into (149) 
to find the transition probability per unit time. The argument of the energy- 
conserving delta function is of course E,, +ha,y — E,—ha, = h(@ng + Oy — Oy). 
To find the probability that the final-state photon emerges in a direction within 
the solid angle dQ, we may sum over a range of values of k’, as in (151). Now, 
the flux of photons in the initial state is the photon number density, 1/V, 
multiplied by the velocity c, say f = c/V. Dividing the transition probability 
per unit time by this flux, we obtain the differential cross section do for this 
process. It is given explicitly by 


V V 27 | e*h 
do = — —— | a, 7 da, dQ = | ————_--(m (k—k’).f 


x ey *(k’) .e,(k)) 5(0p,.+ cy —@ ) 
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or 


ke 


2 
dong = Fo" | ml exp [ilk—K’). Fle) | le Q).e, 0017, (159) 


where we have introduced the classical electron radius 


an e? 


go = = 


mc 4m& mc?" en) 


In (159), the magnitude of k’ is fixed by the energy-conservation condition 
Dye = Wy— Ong: 

If the frequency of the incident photon is sufficiently low, then the dipole 
approximation will be applicable. Then the only non-vanishing cross section 
is that for elastic scattering, 


dogg = To dQ |e, *(k’) - e,(k) |?. (161) 


This is the classical Thomson scattering cross section, which can also be ob- 
tained by either classical or semiclassical methods. The total scattering cross 
section, obtained by integrating over all angles, and summing over 5’, is 


8x 
3 


bot. 


o¢= 

For higher frequencies, the matrix element in (159) must be evaluated 
explicitly. This formula can also be used to calculate the ionization cross 
section, by summing over all unbound states |m) of the electron, up to the 
maximum available energy. 

An exactly similar calculation can be performed for free electrons, leading 
to an expression identical in form to (159). In that case, we may label the 
electron states by the momentum p. If we choose the electron to be initially at 
rest (p = 0), and finally in the momentum eigenstate |p’), then we obtain a 
non-vanishing matrix element only when the momentum-conservation condi- 
tion p’+hk’ = hk is satisfied. When combined with the energy-conservation 
condition E,, +h, = ha,, this condition yields the formula for the frequency 
shift in Compton scattering. 

So far, we have ignored the contribution of A,. To evaluate it, we need 
second-order perturbation theory. By substituting the first-order solution 
um, given by (68), into (67), we obtain u,,2?. Then from this we may evaluate 
the second-order transition probability per unit time. We shall not go through 
the algebra in detail, because it involves certain additional complications 
which would take us too far afield, but merely quote the result. The required 
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modification of the first-order formula is obtained by replacing the matrix 
element <m|A7,|i> from an initial state |i> to a final state |m)> by 


<miftity— yy ea calf (162) 


where the sum is over the complete set of eigenstates |n) of Ay. 

In the present case, the first-order term is zero (except for the contribution 
of A, considered above). Since A, can only create or annihilate photons one 
at a time, one factor of A, must annihilate the initial-state photon, and the 
other must create the final-state photon. These processes may occur in either 
order. Thus the intermediate states which contribute to the sum are either 
states |n> with no photons, or states |n; k, k’> containing two photons. (Here 
in contrast to (162), labels the states of the atom alone.) Inserting the expres- 
sions for the matrix elements of A’,, we see that the term which must be added 
to the matrix element (157) of A, is 


e? h (mip .f,.*®|n> Calb £.@lg> 


~ m: 2(@, Wy )* n E,,—E,—ho, 


(mip .£,Pla> <alp . f,.*®lg> 
% E,—E,+ho, oe 


A particular term in the sum over n will yield a large contribution when the 
incident photon frequency is close to w,,. Each energy level gives rise to a 
resonance of this kind in the scattering. (Higher-order terms in the perturba- 
tion series have the effect of giving each level a finite width, thus removing the 
infinity in (163).) In general, the denominator of the second term in the brackets 
can never be small, so that this term is negligible. 

Terms like (163) should also be included in the case of a free electron. 
However, if the electron is initially at rest, their contribution is identically 
zero. This is because of the factor p in the matrix elements, and the fact that the 
initial state |g» is an eigenstate of fp with eigenvalue 0. Thus for a free electron 
initially at rest the cross section is given simply by (159). 


CONCLUDING REMARKS 


The theory we have been discussing is still quite far from being the full theory 
of quantum electrodynamics, because it involves several approximations 
which, though legitimate for optical frequencies, must be discarded when we 
come to consider higher frequencies. 

We have throughout treated the electrons as spinless, and ignored the 
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interaction between the electromagnetic field and the magnetic moment of 
the electron. This approximation is not essential, and could fairly easily be 
removed. We should then have to consider a system with an extra term, of the 
form (constant) x é.B, in the Hamiltonian. 

A much more important approximation is that we have treated the electrons 
non-relativistically. The transition to a relativistic theory involves a lot more 
than merely a change in the relationship between energy and momentum. It 
also requires that we allow for the possibility of pair creation and pair an- 
nihilation. A high-energy photon striking an atom may not merely scatter but 
produce one or more pairs, consisting of an electron and a positron. This pos- 
sibility also leads to the phenomenon of vacuum polarization, the creation and 
subsequent reannihilation of a virtual pair. Because of this effect, the observed 
charge of the electron, e, is not the same as the constant eg which appears in 
the Hamiltonian. It is modified because, even in a vacuum, an electron is in a 
polarizable medium. Perturbation theory applied to quantum electrodynamics 
tends to give infinite results, beyond the lowest order. However, it has been 
shown that all the infinities may be regarded as contributions to just two 
constants, relating the observed charge and mass of the electron, e and m, to 
the constants which appear in the Hamiltonian, eg and my. Expressed in terms 
of e and m, rather than eg and mp, observable quantities such as scattering 
probabilities are always finite. The predictions of the theory, in particular for 
the anomalous magnetic moment of the electron, and the Lamb shift in hydro- 
gen-like atoms, are among the best verified predictions of any existing physical 
theory. 

In practical terms, if we are dealing with radiation of optical frequencies 
(and even more, of course, for microwaves), then relativistic and spin cor- 
rections are generally small. They would have to be included if we wanted 
results of great accuracy, so that there would be little sense in going to high 
orders of perturbation theory without including them, but if we are satisfied 
with the leading order, there is no need to do so. It is also true that for many 
problems the quantization of the radiation field is unnecessary. As we have 
seen, processes of induced emission and absorption of radiation can be handled 
semiclassically, with the same results (in lowest order) as those provided by the 
full quantum mechanical treatment. Even the process of spontaneous emission 
can, in a sense, be reproduced in a semiclassical theory. We can take the 
expectation value of the electric current operator, and treat that as a source of 
the field, inserting it as an inhomogeneous term in Maxwell’s equations. Then 
we do not get precisely the results of the quantum theory, but something 
approximating to them. In this semiclassical theory, the excited states of an 
atom would, strictly speaking, be stable because the expectation value of ep/m 
is zero in any such state. However, if the initial state is not precisely an eigen- 
state of energy, but contains a small component of the ground state, then the 
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expectation value is non-zero, so that the system does radiate, and will decay 
eventually to its ground state, though not initially according to an exponential 
law. The fact that the emitted radiation obeys Planck’s famous relationship 
between energy and frequency is a consequence of the quantization of the 
atomic energy levels and does not, of itself, require the quantization of the 
radiation field. Field quantization is, however, needed to explain such things 
as photon-counting statistics. 


Quantum Theory of Coherence? 


R. J. GLAUBER 


1. INTRODUCTION 


Quantum electrodynamics is a fairly mature field these days; it is about forty- 
two years old. The curious thing is that although it has become an immensely 
accomplished and versatile study as it has developed from optics, there has 
been very little connection maintained between quantum electrodynamics and 
optics. As you know, the two studies overlap completely in principle, but 
until a few years ago the connection between them remained only rudimentary. 
We shall consider first the question of why that situation existed for so long. 
The much closer liaison which we have now between these two fields developed 
at approximately the same time as the laser appeared on the scene, and one of 
its by-products has been a particularly useful way of describing laser beams. 
The primary impetus to develop the theory, however, lay in a number of 
interesting questions about the behaviour of photons which dated from earlier 
years: we shall see below what those questions were. 

Perhaps the easiest way to obtain an overall picture of quantum electro- 
dynamics is to think of the electromagnetic spectrum. Just how far we say the 
spectrum extends toward its low-frequency end is more or less a matter of 
taste. For example, it could be maintained that we have seen fields oscillating 
at something like 10 Hz or thereabouts; on the other hand, at the violet end of 
the spectrum, the quanta become more and more energetic as we go through 
the regions of X-rays, and y-rays, etc. The y-ray spectrum is rather extensive 
and it is not unlikely that we have seen the effects of y-ray quanta whose 
frequencies are of the order of 1077 Hz. So we have something like 26 powers 
of 10 covering the spectrum with the visible band of the spectrum more or 
less in the middle as we observe it. The vast range of phenomena that 
radiations of these different frequencies produce, of course, recapitulates the 
entire history of physics. 

Probably the greatest of the accomplishments of present day physics lies in 
the fact that one single theory explains everything that occurs over the vast 
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range of the electromagnetic spectrum. Exactly the same things are happening, 
from a mathematical standpoint, in the radio frequency range and in the high- 
est frequency range of the y-rays produced only in cosmic radiation. It is all one 
theory; it has to be one theory because of the character of the Lorentz trans- 
formation, because we know that quanta of one frequency can be turned into 
quanta of any other frequency simply by observing them from a moving 
system. While that does not sound like a very practical thing for people to do 
(turning radio waves into y-rays, let us say), it is a very practical thing for 
electrons; electrons are easily accelerated to the appropriate velocities. In fact 
one way we have these days of creating a monochromatic y-ray beam with 
energies of hundreds of millions of volts is to let ordinary visible quanta 
collide with electron beams in a linear accelerator. That has been done at 
Stanford and it does provide a usable y-ray beam. The electrons thus “‘see”’ 
ordinary light quanta as energetic y-rays and in colliding with them transfer 
just a little momentum to them, but that little momentum looks like an awful 
lot in the laboratory frame, and, indeed, those visible quanta come out at very 
substantial energies. That ought to be evidence enough that the spectrum is all 
one unit, and it must be one theory which embraces the whole. 

The different physical qualities which characterize the different ends of the 
spectrum are completely at variance with one another. From a quantum 
mechanical standpoint, in the low frequency end of the spectrum we have 
fantastic quantum densities. If we asked, ““How many quanta are there, in, 
for example, a cubic wavelength of radio broadcast radiation?” (of course, the 
wavelength gets a little large, for that kind), the answer tends to be an 
astronomical number. It is not unusual to see 10°°-10*° quanta per cubic 
wavelength. We have, of course, very efficient sources of such radiation, ones 
which produce vast numbers of essentially identical quanta. So at the lower 
frequency end of the spectrum we have lots of feeble-energy quanta. At the 
other end of the spectrum we have extremely energetic quanta; individual 
ones can do simply catastrophic things to atomic or nuclear systems. We have 
no trouble at all detecting such quanta by almost macroscopic means, but we 
have very very few quanta there at all. Now, in a sense we have still fewer 
quanta if we compare the number of quanta with how many we could have. 
The idea of mode density in the field is considered elsewhere in this chapter. 
It is slightly artificial for optics to consider the field within a finite volume, 
but it is very convenient to do so. It is convenient because it turns the con- 
tinuum of oscillation of the field into a discrete set of modes which can be 
counted. It is a little artificial because we scarcely know what a perfect 
reflector is for, say, y-rays at 100 MeV. But let us imagine such a thing just for 
the sake of defining the modes of the field mathematically. If we do that, the 
density of the modes per unit frequency interval rises as the square of the 
frequency and so the modes become enormously dense at very high frequencies. 
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But as the frequency increases the number of quanta produced by all reason- 
able sources goes down dramatically. Since nearly all the modes are thus 
empty there is a sense in which all high frequency quantum beams are exceed- 
ingly weak. We are familiar, on the other hand, with quantum beams of 
enormous intensity at the low frequency end of the spectrum where we are 
dealing with quantum energies which are exceedingly small compared even to 
thermal excitation energies. We have to go up to frequencies corresponding 
to wavelengths of 0-1 mm, or thereabouts, before we get to quantum energies 
which are comparable even with rather low thermal energies. So, quantum 
phenomena are masked by thermal noise and are not visible in the low 
frequency end of the spectrum which thus remains the natural domain of 
classical theory. We should have a rather hard time indeed persuading a radio 
frequency engineer that there is any need to deal in quantum mechanical terms 
with the output of a broadcast station. On the other hand, we could not be 
more familiar with the use of quantum theory in dealing with the high fre- 
quency region, essentially infra-red frequencies and higher, and that has 
always been the domain of quantum mechanics. 

There are two other qualities of these two regions which are worth mention- 
ing. One is the degree of control we have of the radiation. Our knowledge of 
classical radiation and how to control it in macroscopic terms is about 100 
years old or so and we have achieved a great degree of versatility in being able 
to control radiation of relatively low frequencies. We can make the fields in 
oscillators do just about anything that the Maxwell equations will allow them 
to do, as a function both of the space and time variables. But, what happens, 
crudely speaking, as we proceed to higher frequencies is that we lose more and 
more control over what the field does. At low frequencies we have temporal 
control; we are able to produce, for example, quite monochromatic oscil- 
lations with steady amplitudes. As the frequency rises we sacrifice that possi- 
bility rather quickly. We still have monochromatic radiations available to us 
in the visible region, but only because of the accident of nature that atoms 
happen to have very well-defined frequencies of oscillation. As we go on into 
the X-ray region there is more and more radiation that just comes from the 
continuum of atomic excitations. Beyond the energies of a few million volts, 
we have even gone past the discrete levels of the nuclei and we then deal only 
with continuum radiation. We have virtually no monochromatic sources in 
the extremely high frequency region, except, perhaps the strange one I 
mentioned which involves the Compton scattering of monochromatic visible 
photons at a particular angle. That is just about the only monochromatic 
source that has been proposed in the multi-million volt region. In other words, 
we lose control of the time dependence of the fields as we increase the fre- 
quency. We also lose, somewhat more slowly, but none-the-less positively, 
control of the spatial behaviour of the fields. Optics in the visible region is a 
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field in which we are able to maintain very accurate control of the spatial 
behaviour of the field, for example, in the creation of interference patterns 
and the like. But it is quite difficult at the same time to maintain very accurate 
control over the time dependence of optical fields. Even though optical fields 
may be fairly monochromatic their amplitudes tend to fluctuate uncontroll- 
ably. We have thus to ask: “‘How does the field behave? What does the 
indefiniteness of the frequency of an optical field mean? What kind of time 
variations of the field are responsible for the line-broadening?”” We shall see 
something very specific about that presently. What we shall find is that all 
radiation which does not come from specifically stabilized sources, i.e. all 
natural radiation is, in a fundamental sense, noise. It may have any sort of 
spectrum but it is what the electrical engineer would simply call coloured 
noise. The whole science of optics, in a sense, has just consisted of the con- 
structive use of noise. 

Quantum phenomena are easily observed in the visible region. They are 
observed in the sense that a photon with an energy of a few volts is perfectly 
detectable in a photomultiplier. Not very much work, historically, was done 
by detecting individual quanta in the visible region; the technique was thought 
of mainly as a way of detecting rather weak light beams. It was thought of, in 
other words, only as a means of intensity measurement and there are other 
and simpler ways of measuring low intensities, for example by integrating over 
a long period of time using photographic detection. There was, in all, remark- 
ably little interest in quantum detection. That has changed a good deal within 
the last few years with the entry of a certain amount of technology into the 
picture—technology and a few very clever ideas. 

Historically, the first development based on quantum detection was in an 
experiment by Hanbury-Brown and Twiss in 1956. (To be perfectly precise, 
it was not quantum counting in the original experiment, it was the examination 
of fluctuations in the photo-current, but in all the subsequent experiments it 
has literally been quantum counting.) In counting quanta we can detect 
something more than just the intensity of the field; we can detect correlations 
within the field, and out of that fact has grown a whole idea of statistical 
studies of photons in the field. The techniques of measurement are ones which 
are carried down from the high frequency end of the spectrum. They are the 
techniques of high energy physics, a gift, if you like, from the people who have 
preoccupied themselves with that end of the spectrum. And there is one more 
great development, which is the development of the laser itself. 

The laser radiates in approximately the same way as the classical radiator 
to which we have been accustomed for a long time at the low frequency end 
of the spectrum. That is done by exploiting atomic transitions and macro- 
scopic techniques as well. What results, in effect, is a transfer of the ideas and 
the techniques of the low frequency end of the spectrum upwards in frequency 
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by a factor of perhaps 10° or so. The result is that now there is a region of the 
spectrum extending over the visible frequencies and into the near ultra violet 
in which we can both generate quanta coherently and detect them quantum 
mechanically, i.e. with devices which detect the individual photons. Now, 
lasers are extremely intense sources of essentially identical photons. Such 
sources were previously available only at radio frequencies but, at optical 
frequencies, the photons were so sparse that we never had to think of two 
identical photons as actually being in the field at the same time. That, crudely 
speaking, is still the case at X-ray frequencies and beyond. There are so many 
modes available to distribute the photons into, that we simply never have, to 
any noticeable degree, two photons occupying a single mode of the field at the 
same time, and so the statistical properties of the fields at X-ray frequencies 
and beyond are in a deep sense rather trivial—nothing very interesting goes on. 

The laser produces almost arbitrarily intense photon beams in the visible 
region of the spectrum and experiments with these beams can lead to all sorts 
of quantum populations of the modes of the electromagnetic field. We can use 
quantum detection to investigate the statistical properties of these populations. 
That is, in essence, the change that has taken place. It has been necessary, 
therefore, for theory to respond in a way which permits us to deal fairly easily 
with many-photon problems, The development of the theory, until a few years 
ago, really went no further than the use of classical theory in the low frequency 
domain and at high frequencies quantum mechanics in its simplest form, deal- 
ing with one or two photons at one time. There are many reasons why the 
classical limit of quantum electrodynamics was never very fully developed, 
even though we had every reason to be confident that quantum electro- 
dynamics has the correct classical limit. The most fundamental of these 
reasons is the smallness of the number e?/(4e hc). Its smallness means that we 
can proceed with great accuracy in doing calculations with individual charged 
particles by a perturbation expansion. In point of fact, it has been very difficult 
in the past in quantum electrodynamics (q.e.d.) to do anything other than this 
sort of approximation, but the perturbation expansion becomes vastly more 
difficult as you proceed from one order to the next. The tendency, therefore, 
has been to restrict our attention in q.e.d. to problems involving very few 
quanta at a time, and that is frequently a good approximation. So, if we are 
dealing with individual electrons, as we often are in this region of the 
spectrum, it is quite sufficient to say that we have an electron with only one or 
two photons in the field. If we are terribly ambitious and work for half a life- 
time we might do a problem with three or four photons in the field, but scarce- 
ly more. There was therefore no great enthusiasm, nor was there a great need 
for dealing quantum mechanically with problems in which we had vast num- 
bers of photons in the field at the same time. These days, however, we have 
very little alternative to doing just that in the visible region for the reasons 
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explained above, and that simply means that we must develop other approxi- 
mations. The approximations cannot be precisely the approximations of 
familiar perturbation theory. We have no choice but to deal with very 
considerable numbers of photons at once. 

The sort of optics which existed until recently (an immensely successful 
study) has been largely constructed in classical terms. The description of inter- 
ference experiments, diffraction experiments, and the like, has usually been 
constructed in classical terms that were observed to work, and the advent of 
the quantum theory really lent a certain support to those rather ad hoc 
procedures of calculation in the following sense. It is very easy to calculate 
diffraction patterns in the one-photon approximation. We say the field has 
one photon in it and ask for the probability that one photon is seen in a 
particular position on the screen. Then we have to interpret that in statistical 
terms; we have to imagine that the one-photon experiment has been done 
time and again and ask where, statistically, that photon turns out to be. The 
average density of the photon, the statistical density, then agrees with the 
classical diffraction pattern and there can be no great surprise about that. 
Calculating the expectation value of the field intensity in single-photon states 
can be reduced in fact to much the same mathematical procedure as the 
classical intensity calculation. All of the calculations that we could do in 
single photon terms were quickly found thus to agree with the results of the 
classical calculations and to lend a good deal of support to the notion that the 
classical procedures were correct. 

The kind of experiment we have described gives us a good way of character- 
ising the older optics, the optics prior to the developments of the past few 
years. In all of these traditional optical experiments we imagine a limiting 
process in which the field intensity is allowed to go to zero, but the detection 
time is allowed to become infinite—in other words, the experiment of Taylor 
described by Sillitto in his introductory remarks in this volume. All of the 
traditional optical experiments (perhaps that is a good way of defining tra- 
dition) have the quality that we could go to zero intensity and just increase the 
detection efficiency and we would detect basically the same thing, the same 
diffraction pattern. Those are one-photon experiments. They are the type of 
experiment we can describe by saying there is only one photon in the field at 
a time. But that description does not hold, for example, for the Hanbury- 
Brown-Twiss type of experiment which detects correlations between pairs of 
photons. 

The Hanbury-Brown-Twiss experiment is really the first one in the higher 
frequency range which stays completely outside that tradition. Since that 
experiment, there have been a great many which step outside the tradition 
and which cannot be dealt with at all in terms of one-photon optics. To begin 
with, we can understand in such terms essentially nothing of the functioning 
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of the laser. The laser is intrinsically a non-linear device; it only works when 
the field intensities become so high that the photons know a great deal about 
each other’s presence. That could not be less typical of one-photon optics. 
More generally, the availability of laser beams has made it possible to develop 
a new area of optics, non-linear optics. In non-linear optics, we do dramatic 
things such as sewing pairs of photons together to make new photons of twice 
the frequency. Needless to say, we learn nothing of phenomena like that from 
one-photon optics, and, as a matter of fact, a great deal in the phenomenon of 
harmonic generation depends upon the tendency of photon packets to overlap, 
or not to overlap, in any given beam. What we are beginning to suggest is that 
the description of the electromagnetic field which was used in the traditional 
optics is really inadequate. While we can describe the one-photon experiments 
by means of a very small group of parameters, in general, or a very restricted 
class of correlation functions, to characterise an electromagnetic field more 
fully, and, in particular, to do it in quantum mechanical terms, we need to 
know a great deal more. 


2. CLASSICAL THEORY 


What we shall do now is to develop something of the newer and fuller des- 
cription of the field that we need in order to deal with this much broader 
class of phenomena. Let us do this by first making contact with the classical 
noise theory and let me introduce the notation for the mode functions. This is 
material that Kibble went over (pp. 11-52, this volume). Imagine that we are 
dealing with the field inside an enclosed volume. The specification of the 
boundary conditions of that volume is really not essential to us but we need 
some sort of boundary condition in order to complete the specification of the 
problem. The boundary conditions could be those of a perfectly conducting 
surface; that, however, condemns us to talking about standing waves and 
makes it difficult to deal with progressive waves if we are really dealing with a 
problem in free space in which the very idea of boundaries is unnatural. A 
simpler thing to do is to use periodic boundary conditions. (All we need agree 
upon here is the existence of the mode functions; the boundary conditions are 
really immaterial, that is why they are left unspecified.) 

We consider, then, the field inside a closed volume and, for an appropriate 
set of frequencies ,, we introduce a set of vector mode functions u,(r) with 
mode index k which satisfy the wave equation 


(v2+2) u,(r) = 0, (1) 


the transversality condition 
V ‘ u, (Fr) = 0, (2) 
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and the boundary conditions on the electric fields. The set of mode functions 
are orthogonal if the modes are of different frequency, or they can be made 
orthogonal if they are part of a degenerate subspace. 

The orthonormality condition is 


[ met) we) dr = dy. (3) 


The electric field may be broken into two terms; we shall call them the 
positive and negative frequency parts and define them by convention thus 


E (r,t) = Er, t) + EO(r, 0), (4) 


where E‘*? is the sum of all terms which vary as exp (—ia,f) for all w, > 0. 
All the positive frequency terms are in E‘*? and all the complex conjugate 
terms are in E‘~. Therefore, we have 


EO) = (E“*)*, (5) 


These are classical fields, we are not dealing with operators yet. 
Now we expand a component of E“*? in terms of the set of mode functions 
u, and obtain 


EM (r,t) = ¥ ¢,u,(r) exp (—iw,¢). (6) 
k 


If there is no source connected to the field, these Fourier coefficients @,, are 
constants of motion. We shall deal with a field remote from its source; let us 
say the field is radiated from a distant nebula; we can regard it, to an excellent 
approximation, as being source-free. The field is then described completely by 
this set of coefficients (complex numbers) @,. The question is, ‘““What are 
they?’’ If the field is perfectly well defined, if there is no randomness in it 
whatever, then we know, in principle, precisely what those numbers ¢, are. 
To a communications engineer, a noiseless field would be one for which we 
know all the complex numbers, @. In practice, there is no such thing as a 
completely noiseless field, we do not know the numbers @,, and the most 
accurate specification of a classical field, in statistical terms, can only be 
through a specification of the probability distribution of these complex 
numbers. Let us call the probability distribution A and whenever we are deal- 
ing with a sequence of numbers we shall gather them using face brackets. We 
then have the notation 


PE, C2; sas) = P({G,}). (7) 
The random variables describing the signal, then, are the complex numbers @, 


and the space over which the signal is defined is the space of the complete set 
of complex numbers. An element of area in the subspace corresponding to 
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each complex number can be defined in this way. We define d?@, as the 
product, 
d’@, = d(Re @,) d(Im ¢,). (8) 


The probability distribution has the following normalization 


[ 7c@3 mae, = 1 (9) 


when we integrate over the entire space of the complex Fourier coefficients. 

So we are beginning to develop a classical noise theory. What happens when 
we measure the intensity of the field? The intensity of the field is given by the 
squared modulus of the positive frequency part (say) of the field. That is the 
same as multiplying the negative frequency part by the positive frequency 
part, 

(LEO, 4) hay = {EO NEO, hav. (10) 


This quantity, however, is a random one, and, of course, if we make any 
measurement of a random field, we get a random result. To compare with 
theory, we have to take an ensemble average. This is the meaning of the 
average we have written. We take the average over the random coefficients 
@, and obtain an intensity. When we measure the intensity of a field we are 
measuring the ensemble average. 

More generally, we can use not just one detector to sample the field, but 
use two different detectors and superpose their output. In that case we measure 
an interference pattern which contains the interference of the amplitudes 
which are brought in by those two detectors, classically. The amplitudes super- 
posed in finding the total intensity can correspond to fields at different spatial 
positions or delay times. It is thus useful to define the function 


Goiass(PG Vt’) = {EO'(rt) EO r't’)},,. (11) 


This is a classical correlation function for the field values at two different 
space-time points. It becomes the field intensity if r and r’ are the same and 
the times ¢ and t’ are the same. The field E“ is linear in the random amplitude 
parameters {@,} and E“ is linear in {@,*}. The ensemble average may also 
be expressed by 


GLC) = [PKC ED EMC, S, (12) 


where the integration is over the entire space of random coefficients. The study 
of these random functions of time (E“*?, E>) in which the Fourier coefficients 
have specified probability distributions is really the subject of stochastic pro- 
cesses, Or noise theory. Any such random continuous function of the time is 
called a stochastic process (a continuous stochastic process in this case). There 
is a vast body of theory about these processes and there is really no need to go 
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into further details of that theory here. We have simply noted a few formulae 
for eventual comparison with the quantum theory. 


3. QUANTUM THEORY 


In the quantum theory we deal, not with numbers, but with operators. The 
field is represented by an operator, but that operator can be split into a 
positive and negative frequency part just as the classical c-number fields. 
(c-number is a term invented by Dirac for an ordinary number to distinguish 
ordinary numbers from operators which he called g-numbers.) 


E(r, t) = EO, 1) + EO (r, 2). (13) 


We are no longer dealing with c-number fields, we are dealing with Dirac’s 
q-numbers, his operators. The negative frequency part of the field is the 
Hermitian conjugate of the positive frequency part. This is not simply the 
complex conjugate, but the transposed complex conjugate, i.e. the operator 
analogue of complex conjugation. Thus, we have 


Eo? = (E‘*)* (14) 
i.e. E are mutually adjoint. 

We can make the same mode decomposition of the positive frequency part 
of the field as we did classically. The operator is expanded in terms of the set 
of mode functions as discussed above, and the role of the classical Fourier 
coefficients {@,} is taken by a sequence of quantum mechanical amplitude 
operators. By convention, we write these amplitude operators as {a,} and 
normalize them so that we have E“*? in the form given by 


EM (r,t) =i > Gho,)* au,(r) exp (—ia,t), (15) 


where u,(r) is a vector mode function. The operators a, are the annihilation 
operators, one for each mode of the electromagnetic field. For plane waves, a 
mode of the electromagnetic field is specified by a propagation vector and a 
polarization for the field which is embedded in the mode function. For 
example, for plane wave modes 


u,(r) = é(k) exp (ik . r) 


where é(k) is the polarization vector. The Hermitian conjugates of the annihi- 
lation operators a, are the creation operators a,'. The normalization factors 
are inserted to secure the familiar canonical commutation relations between 
the creation and annihilation operators for the different modes. 


[a;, ay") = One (16) 
[ a, a ] = [a,", ay" = 0. 
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In other words, the one change which we make in going from the classical to 
the quantum theory is that the complex coefficients (ordinary numbers) 
become operators and that these operators obey the familiar commutation 
relations i.e. have the same algebraic properties as the complex amplitude 
operators which are used to describe the quantum mechanical form of the 
simple harmonic oscillator. The commutation relations (16) thus define the 
amplitude operators for an infinite set of oscillators, one for each of the field 
modes. For harmonic oscillators, it is well known that the products a,‘a, have 
as eigenvalues, the integers n, = 0,1, 2, .... The integer 7, is just the number 
of photons in the kth mode. 

It is possible to go through a great deal of theory just constructing equations 
for operators, but, sooner or later, we have to say something about quantum 
states. Physical quantities are always scalar products of quantum state vectors. 
We are concerned with states of the radiation field. The ground state of the 
field is defined as the state |vac>. Here we have used the notation of writing 
vacuum explicitly in the vacuum state because that is the state in which there 
are no quanta in any mode. The vacuum state can also be written as 


| vac) = | {0}. (17) 


Here we have the set of quantum numbers zero in all modes of the field. That 
is to say, if we apply an annihilation operator to the vacuum state, we get 
absolutely nothing because there are no lower energies available; so applying 
the annihilation operator annihilates the state thus, 


a,|vac> = 0. (18) 


We can generate the m quantum states, | {n,}>, by applying the creation 
operators a,' to the vacuum state, thus 


(a,")" 
{7,3 > Il Vn) 
where we see that we construct such a state by creating m quanta in the kth 
mode, then normalizing, and then doing this for all modes. 

We now consider the description of general states of the electromagnetic 
field. The | {7,}> form a complete orthonormal set (we shall see below some 
sets which are not normalized and are certainly not orthogonal) spanning the 
familiar Fock space of quantum field theory. This is a good set of basis vectors 
with which to describe a general state of the electromagnetic field and, indeed, 
it was the only one which was ever used in quantum electrodynamical calcu- 
lations until just a few years ago. Their use was natural enough; in dealing 
with perturbation theory and never with more than two photons in the field, 
we might as well confine ourselves to discussion of these particular states, 


| vac>, (19) 
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which are very simple in their orthonormality properties, and very simple in 
algebraic terms, But the most general state of the field, obviously, is the most 
general superposition of such quantum states, and so we have 


= 2 Fim |{e} > (20) 


The situation is thus rather different in the quantum mechanical specification 
of states from the situation in classical theory. The most precise specification 
in classical theory was achieved by specifying one complex number for every 
mode of the field. Here this notation indicates a set of coefficients with as many 
indices as there are modes. If we had one mode only of the field, we should 
still have an infinite set of coefficients in that specification of a state. For one 
mode, the state may be expanded as a sum of quantum states with arbitrary 
coefficients summed from zn = Oto oo, thus 


I>= Vfl”, (21) 
n=0 
where the only restriction on the f, is the normalization condition, 
LA? = 1. (22) 


Thus, we see that, in quantum theory, there is an infinite set of complex 
numbers which specifies the state of a single mode. This is in contrast to 
classical theory where each mode may be described by a single complex num- 
ber. This shows that there is vastly more freedom in the quantum theory to 
invent states of the world than there is in the classical theory. We cannot think 
of quantum theory and classical theory in one-to-one terms at all. In quantum 
theory there exist whole spaces which have no classical analogues, whatever. 
The correspondence between classical and quantum theory can only be 
drawn in an asymptotic sense, and usually only in the familiar sense of the 
correspondence principle of Bohr, which is to say that as we go to very large 
quantum numbers (i.e. high intensities), we obtain classical results. Now that 
is a slightly strange statement, familiar as it is, in view of some of the things 
we have said earlier, because the close correspondence that was observed long 
ago between classical theory and quantum electrodynamics in the area of 
optics is really concerned with one-quantum phenomena. In traditional optics, 
the light beams have been very weak indeed. Zero point vibrations are very 
strong compared with the strengths of ordinary light beams. (Zero point 
vibrations may be observed in such devices as the parametric amplifier.) So 
we have been dealing with a situation in which the fields are actually quite 
weak. The quantum numbers are very small, and yet that is a situation in 
which classical theory is applicable. This is a remarkable situation from a 
quantum mechanical stand point. We should not expect classical theory to 


QUANTUM THEORY OF COHERENCE 65 


give universally correct answers for situations in which we are dealing with 
such weak fields. The only way in which quantum theory corresponds to 
classical theory is, ordinarily, in the limit of large quantum numbers and very 
strong fields. 


4. INTENSITY AND COINCIDENCE MEASUREMENTS 


Let us consider an ideal device, an ideal photon counter. An ideal photon 
counter is meant to be approximately the simplest thing we can imagine. We 
also simplify the calculations by showing their skeletal characteristics, by 
leaving out many of the detailed steps and just trying to convey something of 
the spirit of the calculation. A photo-detector is a device in which there is, 
typically, a transition when a photon is absorbed. That may seem trivial but it 
is a very deep and a very far reaching characterisation of virtually all the 
photon detection devices that we use. All the practical devices we have, work 
by absorbing photons from the field and that has some very important con- 
sequences for what they detect. In particular, since it is not possible to absorb 
energy from the vacuum fluctuations, it means, for example, that all of these 
devices remain completely insensitive to vacuum fluctuations, and, as we 
mentioned above, that is rather fortunate. As an ideal detector, we shall use 
a single atom. That is a very good detector, though not a very efficient one, 
and we might have to work rather hard with the single atom, but the good 
thing about it is that it is very small compared with the wavelength of visible 
radiation, and so the chances are very good that we can use the electric dipole 
approximation to describe a transition. In this approximation, the interaction 
between the atom and the field is given by 


H, = -e ¥. q,.E@,2) (23) 


where q, is the spatial co-ordinate of the y-th electron relative to the nucleus 

located at r. Because we are using the electric dipole approximation, it is 

sufficient to write the electric field operator just at the nuclear position rather 

than at the position of all the individual electrons. When the atom absorbs a 

photon, it makes a transition from a certain initial state to a certain final state. 
The transition amplitude is given in first order perturbation theory as 


Ay=M.¢fl[EOO, did. (24) 


There are several little details we have left out, for example, we have paid no 
particular attention to the conservation of energy in writing down this transi- 
tion amplitude and we have lumped all of the factors which enter the transition 
amplitude into the factor M which is the atomic matrix element for going 
between the intial and final states, whatever they may be. The initial and final 
state vectors of the electromagnetic field are |i > and |), respectively. The 


66 R. J. GLAUBER 


operator Et) is just the positive frequency part of the electric field i.e. 
E = E“*)+E©). Here we have just retained the positive frequency part; that 
is already an approximation we are making. There are two terms in the 
transition amplitude one of which can easily be resonant and the other one of 
which never is. The atom, let us say, begins in its ground state, that is in the 
situation where only absorption takes place. In our detector we begin with very 
low temperature detectors (all atoms in their ground state). Then all those 
atoms can do is absorb and therefore only the annihilation operator enters 
the calculation. That is true because of other properties of the correct expres- 
sion which we shail come to presently. The conservation of energy is not part 
of this expression because we have not written down its time integral structure 
at all. The conservation comes about because we are dealing with functions 
that oscillate rapidly as a function of time, and as soon as we integrate over a 
considerable number of cycles of oscillation of the internal motions of the 
atom and the field, we find we get zero in the integrals unless there is a precise 
phase matching between the amplitudes of the field and the atomic amplitudes. 
We shall put in the conservation of energy presently but, for the moment, we 
do not consider it. It is that which leads us to select only the absorption part 
(only the annihilation part of the electric field operator) in doing this calcula- 
tion. If we were dealing with a detection interaction which only lasted one or 
two atomic cycles, (that would be a ridiculously short time for a detector, 
10-15 sec or so) then, indeed, we would have to take account of both the 
positive and negative frequency parts of the field. Usually, we deal with 
detection processes which are a factor of 10° or 10’ times slower, and so the 
statement that energy is conserved in the transitions involved has at least an 
accuracy of one part in 10° or 10’, and it is correspondingly that accurate to 
discard the negative frequency part of the field. 

The transition probability (if you are willing to believe that the transition 
amplitude is of the form (24), and in its mathematical skeleton it certainly is 
of that form) is given by 


T= (im? EAE, 17) P| (25) 


av 


where the sum is taken over all final states because when we detect the photon 
we make no measurement of the final state of the field. All possible final states 
of the field can be summed over, but not all conceivable final states of the field 
are reached in this transition; no matter how many photons there are in the 
initial state, only one is taken away by the absorption transition. Since we are 
not observing the final state, we may as well sum over all of the possible final 
states. All we have said is that most of them do not contribute to the sum but 
we may as well throw them all in to carry out the sum (mathematically that is 
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simplest by far). If we do that, we obtain 
Ty = {IMP Y GEOR DIP CFIEMG, 1] i>| (26) 


f 
Again the summation is taken over final states. However, the completeness 
relation for the set of all possible final states is that the sum over final states is 
just the unit operator, thus we have 


LIs><Sfl = 1. (27) 
From (26) and (27) we get 


Ty: = (1M[?<i| EO, 1) EO, 1) | > hav. (28) 


Another way of writing this expression is to think of the outer product (the 
dyadic product) of the two state vectors as a kind of operator in Hilbert space. 
We then obtain . 


Tr: = |M|? Tr} {[i> <i| Jv EU, NEM 1)), (29) 


Here we have written the two state vectors |/> and <i| side by side and we are 
now dealing with something which has a square array of components if we 
take matrix elements. The structure of (29) (precisely the expression (28)) is 
such that we are evaluating its trace. 

We have assumed here that we know the initial state of the electromagnetic 
field. However, in practice, we rarely have this information. The devices that 
radiate are always systems with many particles and the most we ever know 
about them is their statistical behaviour. Thus we are always uncertain of the 
_ initial state of the field and the transition probability which we measure is 
something that is defined only in a statistical sense. We are making a measure- 
ment with one atom and we get a particular number for the length of time it 
takes that atom to make a particular transition in a field ; that number will vary 
from one repetition of the experiment to another. We have to keep repeating 
the experiment; in other words, we evaluate an ensemble average. That means 
that we must average (29) over the uncertain initial states of the field. The 
advantage of the somewhat cumbersome notation of (29) is that we have 
isolated the two quantities over which we are averaging. We are averaging the 
dyadic product. 

Let us define the density operator, p, by the dyadic product averaged over 
many preparations of the field. We take whatever system it is which produces 


~ What we have done, is simply to write out the absolute value of the matrix element 
squared using the familiar theorem that the complex conjugate of a matrix element is the 
matrix element with the states written in the reverse order of that in which the hermitian 
conjugate of the operators were taken. 
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the field and let it produce that field time and time again, thus producing a 
certain ensemble of initial states of the field. We then average the dyadic 
product over those indefinite initial states of the field and thus we determine 
the density operator 


P= {ld<il}ay (30) 
The field intensity that we measure is I(r, ¢) and is given by 
Ia, t) = Tr{pEO(r, t)E(r, t)}. (31) 


The intensity is the trace of the density operator times the product of the two 
field operators, the negative frequency part times the positive frequency part. 
This explicit structure is very important. Notice that the negative frequency 
part contains all the creation operators, a,', when we are quantizing the field 
and that the positive frequency part contains all the annihilation operators 
a,. It follows then that in the vacuum state we get 


p = | vac><vac | 
Tr(pE}E™) = <vac| EOE | vac) = 0. (32) 


Thus, in the vacuum state the intensity is zero; this is a consequence of the 
normal ordering of the product of field operators (i.e. the fact that annihila- 
tion operators always stand to the right of creation operators). The vacuum 
state is virtually the only case in which we ever really have a pure state. 

The fact that the intensity in vacuum vanishes is very important in avoiding 
the kind of confusion which arises when the zero-point energy (the zero-point 
oscillation) is mixed in with observations. If we write expressions other than 
the normally ordered one, we inevitably run into problems with the zero-point 
oscillations. 

Let us now examine the structure of the kind of statistical average which 
measures the intensity of the field. Firstly, we shall abbreviate the coordinates 
by using x instead of a position and time variable. We can define a first order 
correlation function of the field by 


GO (x4, x2) = Tr{pEO (x )EM(x,)}. (33 
Here it is defined for two arbitrary space-time points, 
¥3(¥1, 3), X2(Fo, ta). 


Had we done the perturbation calculation more fully, including the effects of 
energy conservation (i.e., putting the time integrals and everything else in) we 
should have found for the transition probability (the probability that by time 
t, a single atom undergoes a photoabsorption process) an expression which 
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looks like this more generally 
t t 
p(t) = { ar’ { dt" A(t’—t") GV(rt', rt”). (34) 
0 0 


This is still an electric dipole approximation, where Y is a sensitivity factor 
which contains the atomic matrix elements. We should find that the process 
is not local in the time variable. There are two different times in this expression 
(t’ and t’’). The individual transition amplitude is a time integral and when we 
square the transition amplitude, the probability obtained is a double time 
integral. Typically, our device has differing sensitivities for various frequencies 
of light quanta. All the different sensitivities of the device are easily accom- 
modated in this function “, which can take a variety of forms, and which we 
shall not discuss here in any detail. (A more complete discussion of photon 
counting probabilities and sensitivity problems is given in the Les Houches 
notes, 1964.) For the simplest case of a detector which does not discriminate 
between frequencies, is a delta function. 
For this broad band counter, we have 


S(t'—t") © s5(t'—t"). (35) 


This cannot be a precise statement because the function s has a one-sided 
frequency spectrum, but it can be an excellent approximation relative to the 
relaxation times of the actual field, (i.e. relative to the characteristic times cf 
the function G"). For this case, with a broad band detector} the integral is a 
little simpler. The sensitivity is described by a constant and the double integral 
becomes a single integral. 

Then we can think of the absorption process as being local in time, that is, 
it involves only one time variable. 


t 
PrPYM=s | dt’'G rt’, rt’). (36) 
0 


The rate at which the photoabsorption takes place is given by 
w(t) = (d/dt) pP(t) = sG (rt, rt) 
= sI(rt) (37) 
The electric dipole approximation has neglected the size of the atom and for 
this reason we have evaluated the field at the position of the nucleus. If we 


took the size of the atom into account, then we really could not use the electric 
dipole approximation, and we should probably then have written these cor- 


+ A broad band detector is one whose sensitivity as a function of frequency is flatter than 
the frequency distribution of the radiation to be detected. Typically this radiation is in 
rather a narrow band. 
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relation functions in terms of the vector potential rather than the electric field 
operator, but their structure would be identical. We should find, much more 
generally, that these transition probabilities involve volume integrals over the 
correlation functions. If we were taking the size of the atom into account then 
we would have the function G(r, r’) and we would have to integrate over the 
variables r and r’ as well. There would be two more integrals added to the 
expression and, of course, then there would be the atomic wave functions in 
the expression. All of these are complications which we really do not need in 
order to understand the essential points of the theory. 

Suppose we have n atoms forming n different broad band detectors with a 
shutter in front of each atom. We shall open all the shutters at time 0 and let 
each of them remain open until time t; where j = 1, 2, ...,”. The probability, 
that all n atoms have undergone the photoabsorption transition and the 
transition has been detected for each of them is given by, 


tt t2 tn 
p(t, ...t,) = | an | ai dt,’ 
0 it] 0 


x Tr{pEO ry ty’)... EO(ty ty’) EO (ty ty’). Er, ty}. (38) 


This is the sort of structure which can be obtained, formally at least, only in 
n-th order perturbation theory. But, n-th order perturbation theory is really 
only an n-fold iteration of first order perturbation theory. There is not really 
very much added. In other words, to absorb n photons, we require 7 annihila- 
tion operators. We square the matrix element, therefore we have n creation 
operators (the negative frequency components of the field). We still have the 
structure we had before in terms of the unknown initial state of the field which 
has to be averaged over, so we include the density operator. Finally, we have 
n integrals over n time variables. Each photoabsorption process has been 
made local in each time variable by the assumption that each atom is a broad- 
band detector. We may define an n-th order correlation function by 


G(X. Xaq) = Tr{pEO (x1)... EO(%,) EO (4 4.1) EMP (x2,)}. (39) 


Again we note that it is a product of 2n field operators written explicitly in 
normal order. The annihilation operators (” of them) standing to the right, 
the creation operators (n of them) standing to the left. Now, having defined 
these functions, there are a great many identities we could prove about them, 
but we won’t need any of them for our present purposes. However, one 
obvious thing to point out is this. We can define a kind of joint counting rate 
when we have more than one photon counter in the field by differentiating 
(39) (the probability that by the succession of times ¢,, ..., t, all of the atoms 
have undergone photoabsorption processes which have been detected) with 
respect to all of the times ¢,, ..., t,. If we do that, we turn the n-fold integral 
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WL... t,) = ODM (I, ... t,)/Ot, ... Ot, 
= S"GO (nyt, 00. By fg Vn fan os Bits) (40) 


The expression is therefore the n-fold correlation function, but notice that the 
arguments r,f,,..., 1,4, are repeated. This n-th order correlation function is 
indeed arrived at in perturbation theoretical terms by an n-th order iteration 
of first order perturbation theory. But, on the other hand, the result is different 
from simply the product of first-order perturbation theory taken with itself 
n times. To put it a little differently, the m-th order correlation function, even 
with its arguments repeated in this way, does not generally factorize. Physi- 
cally, this non-factorization is caused by interference effects which take place 
in the absorption of photons by the different counters. We cannot generally 
tell which photon is absorbed by which counter and all of the interference 
terms are implicitly summed in the expression we have written involving the 
trace with the 2n factors of the field. The interference terms would all be 
omitted if we simply took the product of 7 first-order correlation functions to 
represent the right-hand side of (40). The first order correlation function with 
its arguments repeated is a counting rate for a single counter, and what we are 
saying is that the n-fold joint counting rate does not factorize into the product 
of the counting rates that we would measure with each of these counters 
present in the absence of all the others. In other words, there is generally a 
tendency towards statistical correlation (correlation, of course, including the 
idea of anti-correlation as well). There is generally a lack of statistical inde- 
pendence in the counting rates of the m counters in the field. 

We can imagine using other sorts of detectors, one possibility suggested by 
Mandel among others, is to use the process of stimulated emission as a 
basis for detection. We could put into the field a collection of atoms which are 
not in their ground state but in the excited state. In this case, the sort of argu- 
ment we have given would still hold but the first thing we would have to do is 
to emit photons rather than absorb them and so the operators would occur in 
anti-normal order rather than normal order, thus 


ti tn 
p(t, 6 ty) = sf dz,’ mi) Tr{ pEP Wy t,) ... EO, t)}. (41) 
) 0 


The operators occur in anti-normal order for a detector based on stimulated 
emission. There are certain practical problems in constructing a detector based 
on stimulated emission. It is perfectly true that the atoms in an excited state 
will go to the ground state a little faster when we shine light upon them, and 
that in principle is detectabie, but most of what we would detect would be just 
the noise of spontaneous emission. The effect of shining the light on these 
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spontaneously decaying atoms would be rather small so such a detector is not 
very practical. The point of this argument is simply to emphasise that different 
operator orderings are associated with different sorts of experiments; it is the 
experiment which determines the ordering which is convenient. In particular, 
the stimulated emission device would use anti-normal order. Similarly, we 
could conceive a device based on scattering. The photons, in being scattered 
from free charged particles, would make them recoil. Detecting the recoiling 
particles would be another form of photon detection, a very inefficient one 
because the cross-sections are small, but one that is used at high energies in 
the Compton effect detectors. This detector measures a sort of combination 
which is different again, one which is not at all as simple either as normal 
ordering or anti-normal ordering. At low frequencies we can probably say that 
what it detects corresponds more closely to the symmetrical ordering of the 
annihilation and creation operators. The experiment determines the order. 
The ordering really does mean something, at least in terms of the convenience 
of describing particular experiments. Of course, we can always change the 
orders of operators by using commutation relations but we may increase the 
complication of the expressions considered by doing that. Typically, for the 
experiments which have a null result in vacuum, we ought to use normal order; 
that is the convenient one. For most of what we say, normal ordering will play 
a particularly important role. 

We have not exhausted the possibilities of defining correlation functions. 
There are many expressions we can define. For example, we can define a new 
set of correlation functions based on anti-normal order, or any kind of funny 
order we please. We can, furthermore, define more correlation functions 
which are based on normal order, and there has been a certain amount of 
discussion of those, and so let us just write the definition. Here is such a 
definition, 


G05 0.5 XpV1 oes Ven) 
= Tr{pEO(x,) ... EO (x) EP)... Ey) }- (42) 


This function differs in general from the normally ordered correlation func- 
tions we have written earlier by including the possibility of having different 
numbers of annihilation and creation operators. For n = m we have 


Gem = G, (43) 


Expressions of this sort may be defined, but what we find under most of the 
conditions we study, is that these additional expressions for n # m (i.e. the 
extra ones gained by the definition (42)) typically vanish. Let us see why this 
is SO. 

Most optical experiments are performed in essentially a steady state sort of 
light beam. A steady stochastic process is called a stationary stochastic process. 
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This does not mean that nothing happens as a function of time, what it means 
is that all of the statistical statements that we make are independent of the 
origin of time. That is, our statistical knowledge of whatever time dependent 
behaviour may be going on is invariant under time displacement. That is what 
stationarity means for us. Now we can be talking about fields which are oscil- 
lating very rapidly; there is no question about the presence of the oscillation, 
but what we know about the oscillations is independent of the time of day. For 
example, for a stationary field, the correlation function which generally de- 
pends on two times, ¢, and f,, is invariant under time displacement. This 
means if we add a time t to both arguments, we get the same function for all 
times, 7, 


GY (t,t,) = GO(t, +7, t2+7) 
= Gt, —t). (44) 


That means, in practice, that the correlation function depends only on the 
difference of the two times. Now, of course, all statistical statements about a 
stationary field have to be time-independent and that means that there must be 
a corresponding property for the higher order correlation functions as well. So 
here, for the general function G””™, we may write the same statement 


COME AT, sors teem tT) = EOE os tem (45) 


Now what does that statement mean from a microscopic standpoint? What 
does it mean about the density operator for the field? Considering any expec- 
tation value in the Heisenberg picture, the trace of the density operator times 
some product of operators (that, of course, is the structure of all our correla- 
tion functions) we get, 


Tr{ pA(t, +7) B(t, +17)... O(t, +7) 
= Tr{p exp (—iH1/h) A(t,) B(t,) ... O(t,) exp (iH7/h)}. (46) 


Here, the operators are displaced in time. In the Heisenberg picture, we use an 
exponential based on the Hamiltonian operator to displace operators in time; 
thus we obtain the right-hand side of (46). So displacement by time, t, means 
performing a familiar translation displacement under time, a translation on all 
of the operators, and when we perform this same unitary transformation on 
all of the operators, of course all the exponentials cancel except the first one, 
and the last one. Now, the trace has cyclic symmetry, so the last exponential 
can be moved around freely to the first position, and the right-hand side of 
(46) becomes 

Tr{exp (iHt/h)p exp (—iHt/h) A(t,) ... Q(t,)}. (47) 


All of the remaining operators are evaluated at their original times ¢,, ..., f,. 
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We can write (47) in the form 


Tr{pA(t,) -.. O(t,)}, (48) 


provided the transformed version of the density operator is equal to the 
density operator itself. In other words, provided the Hamiltonian commutes 
with the density operator, 

[H, p]=0. 


Commutation of the density operator with the Hamiltonian is the necessary 
and sufficient condition, finally, for stationarity. 

Now supposing we have a stationary field, let us see what we can say about 
the correlation functions, and, in particular, these correlation functions with 
different numbers of creation and annihilation operators. Suppose the field is 
one in which the set of modes which we label by “k” is occupied. The first of 
the more general correlation functions may be constructed as one in which we 
have no creation operators, and one annihilation operator. It then just has one 
time argument. This function is, of course, the trace of p times E+ at the 
time ¢. It is just the expectation value of the field. The form it takes is neces- 
sarily this, 

GO Y(t) = Tr(pE(t)) = ¥, F, exp (—ia,t). (49) 
k 


If we have certain modes of the field which are occupied, we imagine the mode 
decomposition of the operator E“*? and calculate the expectation value and, 
indeed, we have a sum of periodic terms which can only be the exponentials 
associated with the time dependences of the occupied modes. Now, we are 
assuming that the field is stationary, and if the field is stationary, we simply 
cannot have oscillations like that. Everyone of those coefficients F, must 
vanish. If the field is to be stationary, the only possibility is that the correlation 
function G® (t) vanishes. That is not surprising. For a stationary field, the 
expectation value of the field amplitude itself must vanish, so the first of these 
more general correlation functions vanishes. 

Now, how about the other correlation functions? It is always possible that 
we have equal numbers of creation and annihilation operators to pair-off the 
oscillating time-dependencies and in that way to get a time-independent 
function. However, if we have unequal numbers of creation and annihilation 
operators, we are dealing with a condition which is very much more difficult to 
satisfy. The more general situation for a field with {k} modes occupied is, 


nt+m 


Gon (t, aoe te+m) = » wea by Lae ere exp E 2, its y av4| | oo 
4 a 


n-+m 


In general, the (n, m)th order correlation function is a summation for which in 
all the terms of the summand, there occur all possible choices of the expo- 
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nential time dependencies which characterize the modes: there appears in the 
positive frequency sense, m such factors, and in the negative frequency sense 
there appear 7 such factors. If there is any unbalanced oscillation in these two 
exponents, and if we are asserting that the field is stationary, viz. that this 
function is independent of time displacements, then the only way that that can 
be secured is to say that the corresponding coefficient vanishes. A necessary 
condition, then, to havea non-vanishing function G™ is that it be possible to 
put together sets of frequencies of n frequencies and m frequencies, (and we 
can allow repetitions), so that 


nt+m 


n 

Yo,= ¥ ow, (51) 

j=l {=n+1 
We take the set of occupied modes, write down their frequencies, and choose 
n of them with possible repetitions, and choose m of them allowing for possible 
repetitions. However, if it is not possible to do that so that the one sum is 
precisely equal to the other, then there is going to be unbalanced oscillation 
present in the corresponding term for the correlation function, and that co- 
efficient must vanish if the field is to be stationary. That means that the simp- 
lest way of securing a non-vanishing correlation function for a stationary field 
is just to have m = n and to make identical choices of the sets of frequencies. 
Then the one time dependence will cancel the other and it is very easy to have 
a non-vanishing function. It is not so easy if we choose m # n. To satisfy a 
condition of this sort a very special set of frequencies must be occupied in the 
field. For example, if we have three frequencies present, then we see that if we 
call them w,, w, and w3 we must have some such relationship as this, 


1 +a, = 3, (52) 


among the three frequencies, if we are to get a non-vanishing result for what 
we have defined as G‘" ?). In fact, a condition of this sort can hold, forexample, 
in the parametric amplifier, or in the coherent Raman effect where the modes 
excited have this sort of property. But this is only a necessary condition in 
order to have a non-vanishing correlation function for these peculiar orders. 
It is only a necessary condition because, in fact, if these modes at different 
frequency oscillated independently of one another, we should have to look 
further at the structure of the coefficients, and then we still should not find 
any which were different from zero under the stationarity condition. There 
has, additionally, to be a statistical correlation between the density operators 
for these particular modes. If, on the other hand it were true that the density 
operator for the entire field factored into a product of density operators 
(stationary density operators for the individual modes) once again, we could 
show that the functions vanish. The actual physical situation in devices like 
the parametric amplifier or a medium in which the coherent Raman effect is 
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amplified, or a substance in which second harmonic production is occurring, 
is one in which there do exist such statistical correlations between the modes 
of different frequencies. And so, it is true that these peculiar correlation 
functions take on non-vanishing values. 

In the case of harmonic production which is the simplest, we send in red 
photons, and pairs of red photons join together to form blue photons. In the 
outgoing field we have two components present, one of which is at precisely 
twice the frequency of the other. Furthermore, there are phase relations be- 
tween the two fields. There is statistical dependence between these two fields, 
and, in fact, in the region illuminated by such a crystal producing the second 
harmonic, we should find that the function G“ ?) is different from zero. But it 
takes all of the complication of non-linear optics to bring that situation about. 
If we just produced two fields from independent lasers, say, which happened 
to have frequencies such that the one had precisely twice the frequency of the 
other, the function G"’ ?) would still vanish, because there would then be no 
phase relations between the two fields. The dynamical process of harmonic 
generation is, here, precisely that which does generate the phase relations to 
give a non-vanishing function. 

It is interesting to consider how we would detect the function G”), We 
would have to perform a curious sort of interference experiment in which an 
amplitude based on (E‘*)? was somehow or other able to interfere with an 
amplitude based on a single factor of the field E‘. It is not impossible to do 
that and we may consider the following experiment as a suggestion as to how 
it may be done. Figure 1 shows a device which is producing harmonics. The 
problem is to make the two beams R, B interfere. There is a variety of ways. 
We can engage in harmonic production once more on the red beam. Of course, 
we shall then get some of the blue light through and a little bit more. The red 
light will go through and produce some blue light. This is a species of inter- 
ference experiment because we can now let the blue beams interfere with one 
another. However, if we do that, we are measuring G which is something 
quartic in the field and it is not what is wanted. This G® is the average of 
(E“)? (E“)?, What we have to do is some kind of experiment which literally 
measures the high frequency field against the low frequency field. It has to be 
an interference experiment of some sort. A possibility is the following. Sup- 
pose we use an atom as a photon detector and examine the photoelectrons 
emitted. Suppose also that atom has a threshold for photon detection which 
lies higher in frequency than the red frequency used. A single red photon will 
not give a photo-electron, but two red photons will. Now, two photons can be 
observed simultaneously and here we have a good strong beam of red photons, 
so there will be some photoelectric effect and that photoelectric effect will occur 
only in second order; it will be proportional to (E“)*. Alternatively, the blue 
photons, which have, each of them, twice the frequency, can make the photo- 
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electric transition occur by the absorption of a single quantum. These two 
processes are going to interfere with one another, and the amplitude of the 
interference term for the two processes will be proportional to the statistical 
average of E,'~) Ep’*? Ep *, i.e. 


Tr(pE,~ E,‘*? E,‘*). 


As we move the photodetector which is based on the interference of two 
photon absorption versus single photon absorption, we shall discover inter- 
ference terms just because of a small difference of propagation vectors which 
typically occurs. The propagation vector of the blue photons will not te 


Fic. 1. Second harmonic production in a medium; some red light is converted to blue light; 
some red light is transmitted. 


precisely equal to twice the propagation vector of the red photons (kg # 2k, 
in general) because of dispersion. Therefore, as we move the detector based on 
the interference of these two different sorts of absorption processes, we shall 
find fringes in the interference of the two absorption processes. So, in prin- 
ciple, this is a possible measurement. 

Another possibility is to have two converters as shown in Fig. 2. Many red 
photons enter the system, some are converted to blue ones, and some remain 
red. We have a second converter located as shown and some of the remaining 
red ones are turned blue. Finally, we have a blue filter, which removes the red. 
An ordinary photodetector is used. The total field between the two converters 
is the sum of the blue and red components. The final intensity measurement is 
G in the region beyond the converters, and is given by 


GY = Trl pEMOEON, (53) 


This expression is quadratic in the field E’’. The field E” in the region beyond 
the second converter contains the more-or-less unchanged field E,’ and a part 
which is proportional to the square of the red component, thus we have 


E’@) = E,' + const (E,’*)?, (54) 


If we substitute the sum of these two terms for the doubly primed fields in the 
intensity we measure, we see that the function G™ contains E,’“~, E,’“*?, 
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ie. G contains 
Tr{ pE,~ Ey’ E’,(*}. 


In other words, G") contains the function G2) when we are using fields 
between the two converters. There are other ways of constructing the intensity 
measured finally. For example, we can go back to the original fields and write 
the intensity in terms of those, but then it becomes a quartic expression. So it 
really all depends on which fields we are considering, whether we are measuring 
one order of the correlation function, or another. Viewed in these terms, the 
measurement made is one of the (2, 2) correlation function, or the ordinary 2nd 
order correlation function when dealing with fields before they have entered 
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the system. For fields before entering the converters, G contains the second 
order correlation function, 


Tr{ pEOE OEM EO? = G22) = G, (55) 


So in this sense, it is a matter of definition, just which of these correlation 
functions we are considering. But if we are dealing with the fields in between 
the two converters, then we can view the combination of the last converter, the 
filter, and the photodetector as being equivalent to the single atom photo- 
detector with the high threshold which we considered above. 

We have indicated that the higher order correlation functions with the dif- 
fering numbers of creation and annihilation operators can, indeed, be different 
from zero and be measurable. However, this is not done to give too heavy an 
emphasis to these curious sorts of correlation functions; in fact under all 
ordinary experimental circumstances in stationary fields (whenever we have a 
steady beam, we effectively have a stationary field) for m 4 n, they vanish. 
Therefore, it is quite sufficient to retain the simpler set of correlation functions 
and deal just with those i.e. the ones for which the number of annihilation 
operators is equal to the number of creation operators. In practice those are 
almost always the functions which we measure. 
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We have not said a word about coherence yet, so let us go back to the most 
elementary considerations and Jook at the first sort of experiment in which we 
would propose to use the term coherence. This goes way back to optics. 
Figure 3 is a crude representation of Young’s experiment. The screen has two 
pinholes which are very tiny because we do not want to calculate the diffrac- 
tion pattern of each hole. For all we care, the wave which spreads behind each 
of these holes is perfectly isotropic. We are looking at the intensity on a distant 


Sy 


Fic. 3. Young’s experiment. 


screen at a point r, ¢. The field at r is, to a good approximation by Huygens’ 
principle, a superposition of the fields at these two pinholes. 

The point considered lies at two different distances from the two pinholes 
and so we are effectively sampling the fields at these two pinholes at two dif- 
ferent times. The distances are s, and s, and the times are [t—(s,/c)] and 
[t —(s,/c)]. The field is given by 


EO, t) = 1{B(, i 4 £%(r 1 =}, (56) 


All of diffraction theory is bound up in the parameter A which tells us the 
transmission of the hole. (We are assuming that this is independent of the 
angle at which these propagation vectors lie on the other side of the hole). We 
are taking the positive frequency component of the field. Abbreviating the 
appropriately retarded space-time points by XY, and X,, we get 


EO(r, 1) = A EO(X ) + EO(X,)}. (37) 


The intensity is found from the correlation function by evaluating the fields 
E™) and E~ at the same space-time point which is the space-time point on 


the screen, thus 
I(r, t) = Tr{pEO (rt) EO rt)}. (58) 


But each of these fields is a superposition of the field operators (i.e. a sum) 
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corresponding to the appropriately retarded times at the pinholes. Substitu- 
ting the two sums, we get 


I(r, t) = |A|? Tr{p( EO (X41) + EX) (EX + EO'(X2))}. (59) 


This is a sum of correlation functions. Apart from the constant 4? which con- 
tains all the physical optics, it is the sum of four correlation functions, 


I(r, t) = [A|2{GP(X,, X)+ G(X, X)+ G(X, X.) + GO(X,, X)}. (60) 


The function G(X ,, X,) is the intensity obtained if the second hole is closed. 
The function G)(X,, X,) is the intensity if the first hole is closed. Then we 
have two terms which exist only because both holes are open. (We cannot tell 
which hole any given photon went through). These two terms are complex 
conjugates of one another so they are just twice the real part of G“ with the 
arguments in either order, 


G(X 1, X2)+ G(Xp, X1) = 2 Re G(X, X2). (61) 


If we write the function as its absolute value times an exponential of unit 
modulus (i.e. e'*), then the real part is just |G°?(X,, X.)| times the cosine of 
the phase factor @¢, 


I(r,t) = [2 |?{GP(X ,, X) + G(X 4, X2) + 2|GO(X ,, Xo)| cos (X,,X,)}, (62) 


where we have taken G“ = |G“ e'#. Let us now consider what happens as we 
move the point of observation up and down the screen. The positions of the 
holes at r, and r, (the spatial parts) of the space-time position X, and X,, 
remain fixed. As the two distances s, and s, vary the times at which we sample 
the fields vary. The absolute value remains positive and represents the envelope 
of the sort of wiggle which is associated with the cosine. The wiggle represents 
the fringes; the intensity goes up and down. The rising and falling of the 
intensity (i.e. the degree of contrast) is governed by |G"(X,, X,)|. The func- 
tion G(X,, X,) is a correlation function. If it vanishes, it means that the 
fields at the two pinholes are uncorrelated. If there is no correlation, we say 
that the light at the two points is incoherent at the one point relative to the 
other (i.e. relatively incoherent). On the other hand, increase in coherence 
results in an increase in fringe contrast, i.e. the amplitude of the wiggles 
becomes large. Since |cos ¢| is never greater than unity, the fringe contrast is 
maximised by making |G“(X,, X,)| as large as possible, and that signifies 
maximum coherence. There is an upper bound because the total intensity of 
the expression can never become negative. It is very important that we have 
the intensity terms G(X ,, X,) and G(X, X,) because they are intrinsically 
positive whereas the interference term oscillates between positive and negative 
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values. The upper bound to |G“(X,, X,)| is given by the Schwarz inequality, 
G(X, X1) GM(Xy, X_) > |GM(X,, X,)]?. (63) 


This inequality can be proved easily from the positive definite character of the 
density operator; the fact that the density operator can never have negative 
eigenvalues. The squared modulus of the cross-correlation function has to be 
less than, or equal to the product of the two intensities. We secure maximum 
fringe contrast by using the “equals” sign. 


G(X, X1) GOK 2, Xp) = |EM(X,, X2)]?. (64) 


That is a perfectly good characterization of optical coherence as it has always 
been used in ali of the older contexts of optics. Let us agree that that is what 
we mean by optical coherence. 

The fields are coherent at two space-time points (X,, X,) if the equality 
holds and, furthermore, we have optical coherence of the entire field if this 
equality holds for all space-time points X, and X. This is a slightly peculiar 
looking condition, and it is not immediately obvious how it is satisfied. Let us 
cite one sufficient condition for satisfying the optical coherence condition. We 
shall call the optical coherence condition (64) the first order coherence con- 
dition because it deals only with the first order correlation function. We have 
already seen that there is a hierarchy of correlation functions and we shall talk 
about other sorts of optical coherence in connection with those below. Sup- 
pose that the first order correlation function factorizes in the form, 


G(X Xp) = &*(X) &(X2). (65) 


If we write it in any factored form and then use the fact that the function with 
its arguments reversed is the complex conjugate of the function, then we see 
immediately that the two factors must be complex conjugates of one another. 
Furthermore, the field which represents one of the factors must be a positive 
frequency field, that is & contains exp (—iw, ¢). If this factorization condition 
holds, then by just noting that G(X ,, X,) is the squared modulus of this field 
and G(X,, X,) is likewise, we see that the condition for maximum fringe con- 
trast holds identically. Obviously, this is a sufficient condition for optical, or 
first order, coherence. What is a good deal less obvious, but nonetheless true, 
is that if the maximum fringe contrast condition holds everywhere in space— 
time, then it implies the factorization condition. It does not take any deep 
mathematics to prove that, it is just the Schwarz inequality but it does take 
about ten minutes to prove. It is proved in the Les Houches notes, and it is 
also proved by Titulaer and Glauber“. This definition of optical coherence 
goes a little bit beyond the definition that had earlier been used in optics 
because of the usual limitation of experimenting with stationary light beams, 
in fact, the usual definition of the correlation function in optical terms was 
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made in terms of a time average rather than an ensemble average. If we are 
dealing with a time average, we have only one time left in the expression, so it 
is not easy to see then how to extend the definition to deal with non-stationary 
fields. The new definition is a definition which extends immediately to non- 
stationary fields. It is also a quantum mechanical one rather than a classical 
one. 

One of the nice properties of the statement of factorization, once we know 
that the statement of factorization is equivalent to the statement of maximum 
fringe contrast, is that we can see clearly what the relationship is between 
monochromaticity and optical coherence. As we know, all of the efforts over 
many years to increase optical coherence were efforts to provide more and 
more monochromatic optical fields. The curious thing about the definition we 
have given here is that it says absolutely nothing about monochromaticity. In 
fact these fields & into which the correlation function factorizes can have 
absolutely arbitrary spectra. So an optically coherent field according to our 
definition need not be monochromatic at all. However, when we restrict our- 
selves to dealing with stationary fields, then indeed, the correlation function 
must be invariant under time displacements, i.e. it must depend only on the 
difference of two times 


Gt), t2) = Gt, — ty) = E*(t,) E(t). (66) 


If we require then that it factorizes as well, that is, if we impose the first-order 
coherence condition, then the only solution of this functional equation (the 
only way of writing a product which depends only on the difference in the two 
times) is to say that the field & depends exponentially on the time, and then 
since it must be a positive frequency field, it can only take the form 


E(t) = &(0) exp (— iat). (67) 


Are there any fields that obey our optical coherence condition precisely? One 
trivial sort of example is just a classical field which is perfectly well determined 
in its behaviour, one whose Fourier coefficients are absolutely fixed. Then the 
procedure of averaging over unknown Fourier coefficients referred to above 
becomes a trivial one and it is clear that the correlation function factorizes in 
this case. Of course, we are dealing there with the classical correlation function 
and it is that that factorizes. We have said nothing about the quantum mech- 
anical one and we leave open for the moment the question of whether there are 
quantum mechanical fields which do this; as we know, there is a vast variety of 
them. The reason we mention this particular classical example is to show the 
first of the many associations we shall see between the definiteness of the field, 
the fixed character of the (otherwise) random Fourier coefficients and coher- 
ence. Coherence tends to limit the noise in the field. The more coherence, we 
shall see, the less noise is allowable. Optical coherence is a rather weak limita- 
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tion on noise in the field. We can have quite noisy fields which are still con- 
sistent with the optical coherence condition and only by adding further 
conditions can we get rid of much more noise. 

We can go on; having defined first order coherence, and being equipped 
with a great many correlation functions, we can define more exotic sorts of 
coherence. An obvious thing to do is to look at the higher order correlation 
functions and require that they factorize in essentially the same way. Let us 
assume that the first m correlation functions 1, ..., m all factorize in essentially 
this way, 


G(X, 0.5 Xam) = 6*(X1)... E*(Xp,) (Xmas) (Xam) (68) 


for m < n, where the function & is the same in all cases and is not an operator, 
it is an ordinary function which is a positive frequency solution of the Maxwell 
equations that obeys all the boundary conditions which the field must obey. A 
field which obeys this condition for the first n correlation functions will be 
called nth order coherent. There is room for some variation in the choice of 
the particular definition of nth order coherence; some other suggestions have 
been made. Cases in which we have, in practice, fields with precisely 17th 
order coherence (say), and no higher coherence are not known; it is intended 
to suggest by this definition that we are adding more and more conditions and, 
indeed, when we let n > 00, we shall then be talking about a fully coherent 
field; that is the other significant extreme. If nth order coherence holds, then 
(68) implies that all of the correlation functions which represent joint counting 
rates for m-fold coincidence experiments and m < n factorize in this simple 
way: 


G™(X,, eeey X mn X sees X;) =i GX, X ;). (69) 


Notice that the arguments are repeated here, X, occurs twice, and finally X,, 
occurs twice. In other words, fields with higher order coherence are the fields 
for which the registry of counts by the individual photon counters is statisti- 
cally independent. We put n different counters in the field and they will each 
record photons in a way which is statistically independent of all of the others 
with no special tendency towards coincidences, or correlations, or anti- 
correlations, except as is governed by the overall intensities of the fields at 
those particular points. (Of course, if the field strength essentially goes to zero 
all over the place, none of the counters will register.) Here we are considering 
correlation, not rates, and there will be no tendency towards correlation, given 
higher order coherence. 

Now let us switch away for a moment and say why it is interesting to define 
such a curious idea. In the Hanbury-Brown-Twiss experiment (Fig. 4) 
light from a discharge tube (a narrow bandwidth spectral source) is passed 
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Fic. 4. Hanbury-Brown-Twiss experiment. 


through a pinhole and emerges as a very coherent beam in the first order sense. 
The beam is divided by a half-silvered mirror and recorded by two photo- 
detectors. In the original experiment these were photomultiplier tubes which 
gave a more-or-less continuous output which was random. The two outputs 
were then multiplied together to see if there was any correlation between them 
and such a correlation was found. In all later versions of this experiment, the 
detectors are photon counters and coincidence detection is used. In the 
original experiment the detectors were moved relative to their zero points and 
in all subsequent versions of the experiment, the counters are kept more or less 
fixed and variable time delay is used in the coincidence gating. In this experi- 
ment the coincidence rate was plotted against time delay (the two counters 
have finite resolving times and so we expect some smearing-out of co- 
incidences). We should expect that if the entry of the photons into the two 
counters were statistically independent events we should see a uniform back- 
ground in the coincidence rate versus the time delay (Fig. 5). In the original 
experiment a very tiny deviation from the uniform coincidence rate versus 
time delay was observed. In fact, however, the effect is not a tiny one. If it is 
possible to get the band width of the incident radiation to be narrow enough, 
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(9) 
Fic. 5. Results of Hanbury-Brown-Twiss experiment. 
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in other words, if the experiment is done in ideal circumstances, we find a 
coincidence rate corresponding to zero time delay which is twice the accidental 
background rate. So there is in the light entering the system, a distinct 
tendency for photons to occur in pairs, to enter the system in pairs, or at least 
for these two photon counters to register photons simultaneously. That is the 
Hanbury-Brown~Twiss effect (HBT effect). 

Why do we see this effect? Is it, for example, some trick in the mirror? No, 
the only function of the mirror is to allow us to do something which would 
otherwise be very inconvenient, which is to put two counters at the same place 
in space. The mirror creates the same field in front of counter 1 as is in front 
of counter 2. We could perform the experiment with a single counter if we had 
a fast enough resolving time. We should investigate the distribution of intervals 
between photon counts and we should discover that there is an anomalously 
large number of short intervals between photon counts. There is again a 
distinct tendency for photons to clump as they enter a given counter. Well, 
here we have two photon counters in the field and they show results which are 
not statistically independent. This is a situation in which G® is definitely not 
equal to GYG, There is no second order coherence. Then we might think 
that there is no sense in defining such a notion. But, in fact, this statement is 
not true for all fields and one of the ideas which led to all the work we have 
been describing is the surmise, very quickly born out by experiment, that in 
fact there is no HBT effect when a stabilized laser beam is used as the source. 
When we plot the coincidence rate, we find no bump at all (Fig. 5), regardless 
of the time delay. No HBT effect occurs in a stablized laser beam. Well, that is 
the sense of defining it. There is now a good deal of verification that the 
factorization holds, certainly up to some quite high order (six) of the correla- 
tion functions. So up to sixth order there is this factorization in the stabilized 
output of a high quality gas laser; that is the rationale in defining these 
correlation functions and discussing higher order coherence. 

Let us now consider a little further the consequences of defining higher order 
coherence. We can prove some mathematical theorems and if we return to the 
papers of Titulaer and Glauber“? one of the interesting things is this. There 
are more consequences of the first-order coherence notion (i.e. maximum 
fringe contrast, or the factorization condition, which are equivalent) than first 
appears. One consequence is that a species of factorization holds in all of the 
higher order correlation functions. They all factorize into approximately the 
form we should like but with a coefficient g, which is generally different from 
unity. 


G(X, eines X on) = GnO*(X1) eee 6*(X,) E(Xn+1) vals &(X2,). (70) 


In higher order coherence the coefficients g, (which can generally be any 
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positive numbers) are all equal to 1, 
g,=1, all xn. 


So we can specify the coherence properties of a beam by considering first the 
factorization of the first order correlation function and then, if it factorizes, by 
discussing the discrete sequence of numbers g, (they are quite measurable 
numbers). A problem in all definitions of coherence is that they can only be 
fulfilled in approximate terms. If we speak of factorization as a coherence 
condition, in actual fact, we should never fulfil those conditions everywhere 
in the universe. But a laser beam can do it over vast intervals of the space 
variable and intervals of the time variable, perhaps as long as 0-1 sec. 

There are a number of other consequences of the assumption of first order 
coherence. Let us consider the coherence conditions in still one more way. One 
of the ways of securing the first order coherent beam is to occupy just one 
mode. If only one mode of the field is occupied then we can always write the 
correlation function in factorized form, so one mode occupation does it. That 
will always allow us to write G“) as a factorized form. We can broaden that 
example greatly if we define modes more generally than we are accustomed to, 
e.g. define a set of mode functions »,(r, t) in terms of our other mode functions 
u,{r), as 

u(r, t) = py Vix u{7) exp (— ia, t). (71) 


The new set of mode functions are non-monochromatic mode functions. It is 
easy to define, as linear combinations of the familiar creation and annihilation 
operations, new creation and annihilation operators for these new non- 
monochromatic modes. These non-monochromatic modes are perhaps also 
describable as wave packets (whether we should call them modes is open to 
question). The most general first order coherent field is one in which only one 
of these generalized modes, or only one of these wave packets has any photons 
in it at all, other varieties are empty. That is a fairly simple characterization of 
first order coherence. 

All coherence, or higher and higher order coherence represents more and 
more constraints upon the distribution of quantum numbers for the occupa- 
tion of that one mode, and, finally, full coherence, implies that there is a 
Poisson distribution for the number of photons. These are the definitions of 
coherence. Again, are there any fields which will fill all of them? There is the 
trivial example of a classical field with no uncertainty in it. That is a classical 
field in which the Fourier coefficients are absolutely fixed. For such a field, all 
of the classical correlation functions factorize in just the way we are describ- 
ing. So now it is obvious that any noiseless field, any perfect signal, is fully 
coherent. What our hierarchy of definitions of coherence has done is to 
approach in the limit, the sort of field which any communication engineer 
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would say is coherent, viz. one which involves no noise, except, finally, the 
trivial sort of noise, which represents a phase arbitrariness in the field &. Any 
overall phase in the expression just cancels out. So much for the subject of 
coherence. 


6. COHERENT STATES 


We shall now consider quantum mechanical fields which have the property of 
full coherence. First we shall consider pure states since these are likely to be 
the simplest. That means that the density operator can be written as a trivial 
dyadic product of this state with itself, 


p=|><l. (72) 


Then our correlation function G” takes the form of an expectation value in 
that state of a normally ordered product of operators with creation operators 
on the left, annihilation operators on the right, 


G = EO. EM], (73) 


What kind of pure state leads to factorization of this expression? If it were 
possible to diagonalize the operators E“) and E“ simultaneously, then such 
an eigenstate of them would fill the bill very nicely, but we cannot do it 
because the operators Et? and E“? do not commute. Suppose we continue 
the restriction of dealing with just those experiments which can be described 
in terms of normally ordered products. As long as the products are normally 
ordered, we do not really need simultaneous eigenstates of E“*) and E@ 
because all we ever find is E> at the right and E at the left so it is quite 
sufficient to find an eigenstate in this sense. The E“*? (r, t) applied to our state 
gives an ordinary function, &@(r, t) multiplied by that state, 


E™(r, t)|> = &(r,1) |. _ (74) 


Of course, if that is true, then there will be a dual relationship which is this: 
the dual state with the Hermitian adjoint operator applied will give the 
complex conjugate eigenvalue &*(r, t) field multiplied by the dual state, 


<| Er, t) = é*(r, t) <I. (75) 


With these two relationships, we have secured, for all normally ordered 
products, an expectation value which factorizes, and therefore G*” will be 
given by 

G™ = E*(r4, ty). E(rams tam) < | >- (76) 


The two relations (74) and (75) together provide a sufficient condition for 
factorization. We have not in this way determined the most general set of pure 
states which does this. That is done in the papers of Titulaer and Glauber.“ 
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The additional states that are found are not very interesting. They differ just 
in having some extra phase factors, which the states we shall discuss do not 
contain. The additional ones are ruled out by considering the more general 
set of correlation functions G”™. If we require that these factorize as well, 
the only pure state that will do it is the state that satisfies the eigenvalue con- 
dition. The eigenstates of E‘*(r, t) are the coherent states. It is obvious that 
the states of (74) are not states with a fixed number of quanta in them because 
E‘*) is an annihilation operator and the statement here is that we are applying 
an annihilation operator taking away one photon and, even so, we have the 
same state left. That is not possible if we have a fixed number of photons, so 
the number of photons must be indefinite. Now to see what the eigenstate 
condition (74) means in greater detail let us expand the positive frequency 
field operator in terms of normal modes as we did in (15) 


EO =i (ha,/2)* a,u,(r) exp (—ia, t). (77) 
A corresponding expansion must be available for the eigenvalue function 
& =i > (he,/2)* %,u,(r) exp (—ia, t). (78) 


Since the mode functions u,(r) form an orthogonal set the eigenstate condition 
(74) implies that 
a,| > = %| > (79) 


for all values of the mode index k. Since it is possible to project out of the 
dependence on the spatial coordinate, r, all the various coefficients associated 
with the different mode functions, then it must obviously be true that a, 
applied to this unknown state gives us the number «, applied to the unknown 
state. In other words, these states are the right eigenstates of all of the annihi- 
lation operators and now we know the properties of those states. Equation 
(79) alone is sufficient to determine them for us. For one mode, it follows from 
this equation, that the state can be expanded in the following way in terms of 
the m quantum states, 


oa" 

n 80 
Tai ™ (80) 
where the normalization factor is exp (—4|«|*). Then we label this one mode 
state with the complex eigenvalue itself. 

Now, obviously there is such a state for any one mode for any complex 
number « that we choose. For the full set of modes we can write 


{o}> = Il lot. >a (81) 


We secure all possible classical solutions to the Maxwell equations by sub- 
stituting in (80) all possible complex coefficients «, and, therefore, we see that 


le> = exp (—4|a]7)¥ 
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there is a one-to-one correspondence between eigenstates of (74) (i.e. eigen- 
states of the positive frequency part of the field) and classical solutions to the 
Maxwell equations. In this very limited sense there is a one-to-one correspon- 
dence between classical electromagnetic theory and quantum theory. The 
coherent states (which have now to be characterized by the entire set {a,}) are 
in One-to-one correspondence with the classical positive frequency solutions 
to the Maxwell equations. Of course, the way we construct a state of the field 
for all possible modes is to take the outer product, the direct product (which 
is what is meant by the product symbol IJ,) taken over all modes of the 
coherent states for each mode. Those are the coherent states, and some of their 
properties have been discussed in the chapter by Kibble. 

The question now is, “Can we do something more with coherent states than 
simply say that this is a set of states which obeys our coherence conditions?” 
If that were all we could do with them, they would belong in a museum some- 
where. The point is that they are very useful in doing calculations. First of all, 
it is useful to realize that any state of the field can be expressed in terms of the 
coherent states. Because there is such a vast number of coherent states, a two- 
dimensional continuum of them, and because they happen not to form an 
orthogonal set, there is a variety of ways in which we can expand any state, 
in terms of the coherent states. However, we are not going to expand states in 
terms of coherent states, we are not going to expand arbitrary state vectors, 
we are going to be concerned with operators (though the idea is basically the 
same). Suppose we want to expand any state in terms of the coherent states. 
We multiply by the unit operator which is 1/z times the dyadic product inte- 
grated over the entire complex plane, thus 


12> == | lap dal dal2», (82) 


where d?a means the differential of the real part of « times the differential of 
the imaginary part of «.t Here we have explicitly an expansion of our unknown 
state in terms of coherent states. We can do that in general and we can expand 
operators in terms of the coherent states as well. The virtue of doing that, in 
particular for the density operator, is this. Once we have the density operator 
expressed in terms of the coherent states, then since everything we want to 
evaluate is in normally ordered form (at least for the moment) all of those 
operators applied to the coherent states on either side of them are turned into 
numbers and the operator aspect of the problem disappears. We then deal just 


t12>=2 | fap<alr aa 


<a]? = exp (—4|a]?)f(a*) where f («*) is an entire function of «* 
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with complex ordinary numbers. To expand the density operator, p, we 
multiply it by 1 on each side. Having used it twice, we have to integrate over 
two complex variables, 


p= <a| In) <olplb> <Bla?aa?B. (83) 


Again, if we know the operator p, we know the function which stands in the 
middle of this expression. More generally for many modes, we have to expand 
in terms of a whole set of «-coefficients and a whole set of f-coefficients, as 
follows: 


p= =a | Kou} <Caol(6}> C63 T] daa, (84) 


We do the same thing for every mode in the field. There are quite a few modes 
and quite a few integrals have to be done. 

The point of this is that the expectation value of the field operators in a state 
like this (the matrix element between these states) reduces to a product of 
c-numbers. 


{Bi} EO... E | {on} > = E*({B,}) ME (fa {Bibl{on}> (85) 


All of the annihilation operators are applying to the right on a coherent state, 
so all of these annihilation operators become numbers, in other words this 
becomes just the product of eigenvalues &. All of the creation operators are 
applying to the left on a coherent state, these turn into a product of complex 
conjugate eigenvalues. Then we have the integrations left which are implicit in 
this expression. The process of evaluating an expectation value becomes a 
process of integration over two complex variables for each mode. This is the 
most direct and the most obvious way of using the coherent states but there 
are other ways. The advantage of this particular procedure is that the sorts of 
functions that we deal with are always terribly well behaved. It is always true 
that the scalar product of the state « with anything can be expressed as 
exp (—4|a|”) times some function of the complex variable «*, but it has to be 
an entire function. (An entire function is about the best behaved sort of 
function we can have — it has no singularities anywhere in the finite plane.) 
So those are non-singular functions, and, equivalently it is true in this repre- 
sentation that the weight function for the density operator, when multiplied by 
exp (4|a|7) exp (4/8|7) is likewise an entire function of a«* and B, again a very 
well behaved sort of function. We have that for the cost of doing two integrals 
for each mode. 
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There are other ways of proceeding. Let us first see something of the 
ambiguity in the use of the coherent states as an expansion basis. We can take 
the operator I and expand it in two ways in the coherent states, one way is this, 


l=5 i la) <al 1 |B) <Bld2a 2p 


1 
= =| => <P exparp Hal? pl) da dp, (86)} 


This is a representation of unity as a double integral in terms of the coherent 
state. But we know another representation, which is this, 


l= =f la> <ajd7a. (87) 


If we do the f integration which is easy enough to do in this case, the first 
form of the representation degenerates into the second one. We can derive 
the second one from the first one. There are these two different ways of expand- 
ing this sample operator, unity, in terms of the dyadic product with different 
coherent state arguments and in terms of those with the same arguments, 
which is to say projection operators on the coherent states. These two varieties 
of expansion can be available for operators of all sorts and we shall next 
discuss the availability of the second kind of expansion for the density opera- 
tor. We expand the density operator in the following way in terms of projec- 
tion operators on the coherent states, 


p= | Pa)|a> <al da, (88) 


where P(«) is a weight function which must be a real valued function of « in 
order to keep p Hermitian (as it must be). Is such a representation available? 
Well, thereby hangs a tale. Of course, if it does exist, then, since the trace of 
p is | and the trace of the dyadic product of the « state with itself is just unity, 
the function P(«) must be normalized when integrated over the entire complex 
plane, 

Trp=1= | P(a) da. (89) 


If this representation exists, P(«) has some of the properties of a probability 
distribution. In fact, it is not one, and we shall see why below. But, is there such 
a representation as this and can we use it; and can we manage to survive what 
happens in the literature if we do? 


tT <a|1]B> = <a> 
<alB) = exp (a*B—Slal*—416)*) 
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The point of expanding in terms of coherent states is that it simplifies the 
evaluation of statistical averages (expectation values). The expansion in terms 
of coherent states can be carried out in two ways and we illustrated both of 
those above. We recall that for the unit operator there are two species of 
expansion. The one is an expansion in terms of pairs of different coherent 
states and the other is an expansion in terms of pairs of identical coherent 
states. Now this second form of expansion is the simpler one since it involves 
only one complex variable for each harmonic oscillator in the field (only one 
complex amplitude for each of them). Let us talk therefore about that 
particular representation, which, wanting a better name, we call the ‘P- 
representation’, 


an { P(a) Ja) Cal d2a. (90) 


The function P should be real valued in order to keep p a Hermitian operator. 
There is a normalization condition on the density operator which is that the 
sum of its eigenvalues (which, as you know, represent probabilities) must be 
unity. The trace of p, therefore, is 1; that means in this representation that the 
integral of the function P taken over the entire complex plane has to be unity, 


i P(a) da = 1. (91) 


We have left completely open the question as to whether it is possible, in 
general, to construct such a representation. For some states obviously we can 
do it. For example, for a pure coherent state, the one based on the amplitude f, 


p = |B> <Bl. (92) 
The way in which we do it is simply to choose for the function P, a 6-function, 
P(a) = 5(a—), (93) 


where 56(y) is a two dimensional 5-function. A 2-dimensional 6-function is 
the product of two one-dimensional 6-functions, 


6 (y) = 6(Rey) d(Im y), (94) 


and the arguments of those functions are real, i.e. one is the real part of the 
argument, the other is the imaginary part of the argument; both of those are 
real numbers. The great advantage of this representation, where we can use 
it, is the following. If we have a normally ordered product of operators and 
want its statistical average, then we substitute for p the integral expression 
given in (90) for the density operator and the trace is then the average 
expectation value in a coherent state of the normally ordered product, 


Tr(pat"a") = | P(a) Cal at*a™ |x) d2a, (95) 


QUANTUM THEORY OF COHERENCE 93 


But now the operator a” operates on a coherent state to the right and a‘ on 
a coherent state to the left; each of those operators then turns into its eigen- 
value and what we are averaging is just a pair of complex numbers, 


Tr(pat"a™) = | P@arram da. (96) 


The averaging of operators turns into the averaging of numbers, but not only 
that, the averaging procedure looks a great deal like statistical averaging of 
the usual sort in probability theory. 

There are a number of things to observe about this representation when we 
can use it. The temptation is very great to think of the function P as a proba- 
bility density. All of the properties that we have indicated here are properties 
of a probability density. For example, when we have a perfectly definite state 
' represented by a 6-function which is a non-negative function, or can be chosen 
as such, the averaging procedure we use is exactly what we would use in 
probabilistic terms. The only trouble is that this function P is not a probability 
density. It is not for many reasons which are all more or less related. The 
states that we label with the parameter a (the coherent states) are not an 
orthogonal set. So, for example, the projections in the directions of these 
coherent states in our entire space of states are not orthogonal projections. A 
harmonic oscillator can be in two different coherent states at once. We are 
not, therefore, summing mutually exclusive possibilities in summing these 
projection operators, so at least in that sense, P is not a probability density. 
It is not measurable experimentally as a probability density either, because « 
is an eigenvalue of a non-Hermitian operator, a, which has real and imaginary 
parts, or Hermitian and skew-Hermitian parts which do not commute and 
which are not, therefore, simultaneously measurable. They correspond to the 
position and momentum respectively of the oscillator and we cannot carry 
out a measurement which tells us anything definite about both position and 
momentum simultaneously. So there are no measurements we could make to 
determine a probability distribution (density) which would correspond to the 
function P. We have to think of the function P simply as a weight function in 
an expansion of this sort. Asymptotically, in the classical limit, it becomes a 
probability density. We shall see below a little more about what the classical 
limit is. In general, quantum mechanically, we have to face the possibility 
that the function P can take on negative values. (It is easy to find examples for 
which it does.) If P does happen to be a non-negative function it is easy to 
show that that is consistent with the positive definite character which the 
operator p must have, namely p as an operator can never have any negative 
eigenvalues. Indeed, positive functions P lead to that condition, but the con- 
verse is not true. Positive, definite and perfectly well behaved density operators 
do lead to functions P which freely take on negative values and there is in 
general, no avoiding that. 
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The function P is analogous to the probability density; asymptotically, in 
the classical limit, it is the probability density and it is probably best to give it 
the name, a name which was first used for another function, the name of quasi- 
probability density. The first such function which was known was that intro- 
duced by Wigner.“ Wigner’s interest was in finding the quantum mechanical 
analogue of the phase space density function in classical statistical mechanics. 
In classical statistical mechanics we are accustomed to talking about systems 
in terms of a density function in the space whose co-ordinates are position 
and momentum variables simultaneously. Now there is obviously something 
peculiar about doing that in quantum mechanics. It is only in a very limited 
sense that we can discuss such a function. It is obvious that no such thing can 
have the interpretation of a probability density, in general. Well, Wigner found 
such a function. It plays, as we shall see, for certain orderings of operators, a 
role analogous to that of the probability density. The particular operator 
orderings involved are ones which are symmetrical in the p and q variables. 
That happens to be the same statement as the statement that the orderings are 
symmetrical in the a and a‘ variables. The Wigner function has a particular 
range of uses. It is most useful, naturally, when we are talking about sym- 
metrically ordered variables but they are not what comes out as a rule in des- 
cribing the simplest experiments in quantum optics. Well, our function P is 
not the Wigner function, it is another quasi-probability density, but, as we 
shall see below, a fairly close relative of the Wigner function. 

Now, let us see what happens when we deal with a slightly more compli- 
cated situation in which many modes are excited. Then, of course, we have to 
deal with a many variable function P ({«,}). We deal with a function in which 
one amplitude is specified for every mode of the field. If we just define a symbol 
now which is the field eigenvalue corresponding to a particular coherent state, 
we see, if the coherent state is the one labelled with all of the amplitudes «,, 
we can write the eigenvalue of the positive frequency part of the fields as & 
and it will be an expression which is linear in the amplitudes «,, 


EM (r,t) | {ok > = Cr, t{on}) | {on}. (97) 


The expression is a linear combination, an infinite sum, a sum over all the 
modes of terms which are linear in the amplitudes «,. Now suppose we are 
told that such a P-representation exists for the density operator of the entire 
field, how do we calculate the correlation functions? Let us illustrate this for 
the first order correlation function. The story, of course, is the same for all of 
them. The first order correlation function is the statistical average of the 
normally ordered product of the field operators, and because it is a normally 
ordered product, we can just replace the field operators by their eigenvalues. 
The first order correlation function expressed as an integral taken over all of 
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the complex mode amplitudes «, is given by 
G(X, X,) = | P({cu,}) &*(X , {0y}) 8(Xp{oq,}) TH d04,. (98) 


This is an exact quantum mechanical expression, and yet we see here some- 
thing which looks a great deal like the classical procedure which we encount- 
ered in eqn. (12) for evaluating the classical first order correlation function. 
The correspondence is as follows: 


E(X{a,}) <> EGD, (X{G}) (99) 
P({a,}) <> P({,}) (100) 
i(ha,/2)?a, — G;, (101) 


The field eigenvalue & corresponds simply to the positive frequency part of the 
classical field. The weight function in our representation of the density opera- _ 
tor, P(«), corresponds to the probability density for the random Fourier co- 
efficients @,. Finally, the @’s themselves correspond to particular given con- 
stants multiplying the amplitudes a,. When we make these identifications, 
there is a precise correspondence between the formulae for the classical cor- 
relation function and the quantum mechanical one. Presumably, we use the 
classical procedure when the numbers are meant to describe strong fields. But, 
of course, if we are dealing with really strong fields, the number @, is not an 
infinitesimal amplitude, it is substantial in typical classical units. Then since 
Planck’s constant is so terribly small, that is going to mean that |c| is going to 
have to be very large, and so the classical limit, strictly speaking is the limit in 
which the dimensionless excitation moduli, |x], become very much larger than 
unity. 

An illustration of the transition from the quantum to the classical limit may 
be seen in the analysis of the parametric amplifier by Mollow and Glauber. 
The dynamical behaviour of an amplifier is discussed. Initially any field is put 
into the amplifier—any field at all. Now, in general, for arbitrary quantum 
fields, the function P can take on negative values of course, but worse still, it 
can be quite singular; we see one example of it in the case of the coherent 
state itself in which P is a 6-function. But, as we shall see in a little more detail 
below, the function P can be even more singular than a 6-function, and in fact 
it can be so singular that there is a serious question whether it exists at all. We 
put such fields (for example for arbitrarily singular functions P) into our model 
amplifier and let the amplifier amplify and develop the function P as a function 
of time. The density operator of the field in the amplifier of course keeps 
changing as the amplifier amplifies it and what we discover is that, physically, 
the amplifier turns this, perhaps poorly behaved function P at time zero, into 
an exceedingly smooth and well-behaved function at large time. If P took on 
negative values at time zero, it turns out there is a time beyond which it can 
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only take on positive values. The function P depends, in other words, on the 
initial field. The field in the amplifier is described by a function P which 
eventually becomes very smoothly behaved over distances in the complex 
plane of the order of one unit and takes on all of the properties of a probability 
density in the classical limit in which it really becomes one. 

We have so far provided a precise, non-trivial justification for the procedure 
which has always been used in optical calculations, where it has been used 
correctly. The grand tradition in doing optical calculations in determining the 
intensities of interference patterns and the like, has been to assume that a 
classical field is incident on the system, to calculate the field amplitude where 
it is being observed and then to average that field amplitude over some sort of 
distribution. Typically, we do not deal with the absolute intensity anyway; we 
normalize it relative to the input intensity. In that case, we are carrying out 
exactly this averaging procedure and we have shown that for the class of 
experiments in which we are detecting what amounts to the averages of 
normally ordered operators, that is a precisely correct procedure to carry out. 
The general characterisation of such experiments is that they are the ones 
which give null results in the vacuum. It is not really necessary to call that the 
semi-classical method, or quasi-classical, or anything of the sort, it is precisely 
quantum mechanical provided we know that the representation of the field we 
are dealing with exists in this particular form in terms of the coherent states 
(P-representation). Now, of course, if we are doing a different experiment, 
which is based upon a different ordering of the operators, then the story is 
different. There is no statement about the strength of the field required in 
order to use this particular representation, it holds for arbitrarily weak fields. 
In that sense, it is a completely quantum mechanical result—we simply should 
not describe it as classical in any sense at all. 

Now we have to face the question ‘‘Can we find such a representation?” and, 
“How, in particular, do we discover what the function P is for any given field?” 
There is a number of such questions which we can gather together and discuss 
as a unit. First let us note that there is a whole class, a whole family of 
functions which we ought to identify as quasi-probability densities. One of 
them is the function P, above, another one is the Wigner function which is 
now some 38-39-years-old, and still another is a function which we can form 
very easily by taking the diagonal matrix element of the density operator in a 
pure coherent state; that, obviously, is something which is non-negative, it is 
greater than or equal to zero, because the density operator is a positive opera- 
tor, 


P(a), Wa), <alpla> > 0. (102) 


The functions (102) are quasi-probability densities because again there is no 
way in which we could measure them having measured the real and imaginary 
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parts of the co-ordinate a and observing the frequencies with which they occur 
(notwithstanding the fact, in other words, that <a|p|a> is a positive function, 
it is still a quasi-probability density). These are three particular examples of a 
more general family of quasi-probability densities, W(«, s), distinguished by 
the parameter s. This general family of quasi-probability densities is described 
in detail by Cahill and Glauber.“ There are many mathematical theorems 
we can prove about them and we shall only discuss one or two of them. 

One theorem which is very convenient to know about, if we want to order 
creation and annihilation operators, is the theorem for multiplying together 
two exponential functions of operators which have a rather simple commuta- 
tion relation. We assume that their commutator commutes with each of them. 
Obviously this is true for creation and annihilation operators because the 
commutator is unity or zero. The product of two such exponentials is not 
simply exp (A+B) because A and B do not commute but all the correction 
we need is to add in the exponent the quantity 4[A, B], which can be treated 
as a c-number since it commutes with both A and B, 


exp A exp B = exp (A+B+3{A, B)). (103) 
This holds for all operators A and B such that 
[ALA, BJ] = [B[A, B]] = 0. (104) 


The simplest way to prove this theorem is just to put a parameter, say A, in 
the exponent, differentiate with respect to the parameter and then observe that 
the differential equation obtained may be immediately integrated to give the 
result. The proof is in Messiah’s book on quantum mechanics.‘” 

Let us now define a unitary displacement operator D(A). There is in the 
exponent a skew-Hermitian operator. This is an operator analogue of, for 
example, exp [i(real number)]. It is a unitary operator in the sense that 
D~* = D(-A) = D1 (A). Now, when we consider ordinary probability densi- 
ties, it is useful, frequently to define the Fourier transform of the probability 
density, There are a great many calculations done in probability theory which 
are done more easily that way; it is the so-called characteristic function. 
However, in considering a density operator in the present case we have to find 
a somewhat different analogue for the construction of the characteristic 
function. We shall still call what we construct a “characteristic function” but 
we shall take it to be formed in this particular way, 


X(A) = Tr{pD(A)} = Tr{p exp (Aat —A*a)} (105) 


where the Fourier transform of the probability density is really the expectation 
value of the Fourier exponential function. Here we simply take that expecta- 
tion value by taking the trace of the density operator times the unitary opera- 
tor D. 
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It is implicit in this exponential function that operators a and a‘ are treated 
in similar ways and if we were to expand this exponential function of the 
operators we would find that the operators are ordered in all possible ways in 
each term of the expansion. This, in other words, is a symmetrically ordered 
expression when we expand it. 

Now, there are other characteristic functions that we can define in which 
the operators would not be ordered exactly symmetrically. For example, we 
can define another version of the characteristic function in which we separate 
the exponential functions of the two operators (that is of course simply a dif- 
ferent function). Here the operator content of the expectation value is written 
in normally ordered form: 


Xy(A) = Tr{p exp (Aa‘) exp (—A*a)}. (106) 


Of course, that has to be related to the characteristic function defined by (105) 
and, using the multiplication theorem for exponentials, we obtain the latter 
with the correction factor exp (4]A|), 


Xy(4) = Tr{p exp (dat —A*a)} exp (41A|?). (107) 


We could equally well define a characteristic function in which the operators 
are in antinormal form and the correction factor is exp (—4A|?), 


X,(A) = Tr{p exp (—A*a) exp (Aat)} (108) 
= Tr{p exp (Aat —A*a)} exp (—4]A|?). (109) 


We are considering here two examples of something which is obviously more 
general. 

We can define a characteristic function which depends not only on the 
arbitrary parameter 2 but on a new variable s; we shall define it with a cor- 
rection factor exp (45|A|”) thus, 


X(A, s) = X(A) exp (4s|A|?). (110) 


Then, having done that, we can identify the three characteristic functions 
which correspond to different values of the parameter s. When s = 1, we have 
the normally ordered one, when s = 1 we have the anti-normally ordered one, 
when s = 0 we just have the one defined originally, 


X(A,1) = Xy(A), 


X(4,0) = X(), (111) 
X(A, -1) = X, (A). 
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We can use the exponential function to define s-ordered products of 
operators, 


exp (lat —A*a+4s|d|*) = ¥ Seal B fal"a™}.. (112) 


This is very convenient for the purposes of dealing with all orderings at once. 
Suppose we write the exponential operator and add a correction factor. When 
Ss = 1 we know that this exponential is identically equal to the normally 
ordered exponential. When s = — 1 we know that this exponential is identical- 
ly equal to the anti-normally ordered exponential function. When s = 0 it is 
obviously the symmetrical one. Those three cases, in other words, are included 
and then, of course, by letting the parameter s take on other values, we include 
many other ‘“‘orderings”. In a formal sense they are orderings of operators but 
I do not know how we would write them if, for example s = ./1/z; it is simply 
a formal notion, but it does allow us to deal with the above three orderings 
together, and to see something which we would not otherwise see of the 
relationship among the functions (the auxiliary functions) for each variety of 
operator ordering. The right-hand side of (112) is obtained by expanding the 
exponential function. We can use the terms of the power series expansion to 
define s-ordered operators and then for s = 1 we are dealing with normally 
ordered combinations (normally ordered products), for s = — 1 we are dealing 
with anti-normally ordered ones, and for s = 0, this is a definition of sym- 
metrically ordered ones (and what we mean by symmetrically ordered there, is 
really taking the arithmetic mean of all the possible ways of arranging these 
particular factors in each term of the expansion). 

Now we can see the following things. If we want to evaluate the statistical 
average of such an s-ordered product, we can do that by going back to the 
appropriate characteristic function, differentiating with respect to the para- 
meter A, or 1* an appropriate number of times, and then setting 4 = 0, 

a” o” 

on” O(—A*)" 
This s-ordered characteristic function we have defined is really a generating 
function for the statistical averages of the arbitrarily ordered products of 
operators. If we know the characteristic function, then we find the averages 
just by evaluating all of its derivatives at the origin, atA = 0. 

Now let us look at the case which interests us most, the case of normal 
order, the normally ordered characteristic function, for which s = 1. Let us 
assume that P-representation exists and, in that case, we evaluate that charac- 
teristic function by just taking the statistical average of the normally ordered 
exponential function, and that is a trivial thing to do. Since the exponential is 
normally ordered, each of the operators in it simply becomes an ordinary 


X(A, 8) [Jee Tr[p{at"a™}.]. (113) 
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number multiplying a coherent state; the scalar product of the two coherent 
states is unity and they go away and we are left then with this explicit evalua- 
tion of the normally ordered characteristic function, 


X(A, 1) = Xy(A) = | P(«) <a| exp (dat) exp (—A*a)|a> da 


= | exp (Aa* —A*a) P(a) d?a. (114) 


The exponential function here has an imaginary argument; it is a function of 
the two complex variables 4 and « and from an algebraic standpoint, it is 
simplest at this stage to use Cartesian coordinates. When we do that we see 
that we have a Fourier transform in two variables of the function P. The 
Fourier transform is, presumably, invertible (that is a bit of a question here, 
we will have to come back to it). Let us assume that the Fourier transform is 
invertible. A convenient way in which to write the Fourier integral here uses 
the Dirac—Fourier integral representation of the 6-function written out in two 
dimensions and in complex notation. This particular form of integral 
representation is, 


5 [ exp [A(a* — B*) —A*(x— B)] d?A = 5 (a). (115) 


By multiplying eqn. (114) by an exponential function of 4 of the appropriate 
form and integrating, we, in effect, project out the function P, 


P(a) = - | Xy(A) exp (aA*— oA) d22. (116) 


The function P is thus the corresponding Fourier integral of the normally 
ordered characteristic function, providing we can invert the Fourier transform. 

The problem in inverting the Fourier transform is that for some states (and 
these are perfectly reasonable ones,) the normally ordered characteristic func- 
tion, though it is a continuous and very well-behaved function, becomes 
extremely large as the modulus of A goes to infinity. The only general bound 
that we can put on the normally ordered characteristic function comes from 
this consideration: The ordinary characteristic function, the one which is the 
expectation value of the unitary operator, has to be bounded by unity, simply 
because it is the average of a unitary operator, 


[X(A)| <1. (117) 


However, since the normally ordered characteristic function is equal to the 
ordinary one times a Gaussian factor, then we see that the normally ordered 
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characteristic function is only bounded by a function which blows up as A 
goes to infinity, 


[Xy(A)] < exp (41A]?). (118) 


Now, we can indeed find quantum states, which at least come close to meeting 
this bound as A - oo and then we are in a terrible position; if we insist upon 
inverting the Fourier transform (114), we have to calculate the Fourier integral 
of a function which blows up as exp (+7), as it were. We can find examples, 
and many of them, for which that happens in at least one of the two variables 
of integration. In other words, there are cases in which it does not seem 
possible to define the function P in any familiar sense in Fourier transform 
theory. Certainly not in the sense of square integrable functions, and in fact 
not more generally in the much broader context of tempered distributions 
which is discussed these days in connection with Fourier transform theory. 

Now, we can take either of two attitudes towards this problem which seem 
to have evolved in the literature. One attitude is that when we are dealing with 
quantum states for which the function becomes so enormously singular, there 
is really not much convenience left in defining the P-representation or attempt- 
ing to use it, and there is certainly very little sense in saying that we have any 
remaining resemblance between the quantum mechanical and classical 
pictures. A different attitude is that we can define, perhaps not a P-representa- 
tion in the sense that we have been describing, but we can, for example, do 
calculations by using limiting procedures by representing the density operator 
as a limit of an approximating sequence of P-representations. That has been 
the attitude of Klauder and Sudarshan.“ Their coworkers have gone on to 
identify integrals of this sort, not with distributions or tempered distributions 
in the ordinary sense, but with things which are now being called ultra- 
distributions. They are still a vastly more general notion of a generalized 
function. This is rather deep mathematical water because once we have defined 
integrals of this sort, it is not at all clear what we do with them. Thus far, 
nothing more than the definition seems to exist. 

There does not seem to be a clear or simple answer to the question of 
whether the P-representation can be constructed in general. It depends, at 
least in part, on definition. Now, for the other quasi-probability densities, the 
situation tends to be rather simpler and we can deal with them all at once in 
the following way. 

We can define a function W(«,s), where s is the ordering parameter, and 
take that in each case to be the Fourier transform of the appropriately ordered 
characteristic function, 

d7A 
W(a, 8) = | X(A, 3) exp (aA*—a*2) —, (119) 
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Then the formula (114) is precisely this for the case s = 1 except that we have 
put a factor of 7 into the formula for W, 


W(a, 1) = mP(a). 


We can invert the transform (119), in general, (or let us assume that we can); 
then the ordered characteristic function is written as a Fourier integral 


XGa= | exp (da* —A*a) W(a, 5) a (120) 


Suppose we differentiate the inverted transform (120) in the neighbourhood of 
the origin. We have already shown that is the statistical average of the 
s-ordered product of operators (113). Now it is obvious that if we differentiate 
the exponential in (120) and then set A = 0, we are finding the moments of the 
function W, 

o” o™ 

an" a(—A*)" 

Now for all of these orderings, for the three familiar orderings and for all of 
those that we define formally for all complex values of s, the following is true. 
The statistical average of the ordered product of operators is given by the 
ordinary c-number moments, the classical moments, so-to-speak, of the 


function W. In particular, for n = m = 0 we have just the normalization 
relation, 


2 
X(A, s) ig Tr[p{at"a™},] = [ amen, s) = (121) 


2 
[ Wa, yt = (122) 


because, of course, the characteristic function at argument zero is just the trace 
of the density operator, that is, just unity. So (122) is a normalization condi- 
tion for the entire family of quasi-probability distributions. For s = 0, we have 
precisely the function W which was defined by Wigner in terms of real vari- 
ables, not in terms of complex amplitudes, many years ago. For s = —1 we 
have anti-normal order (X(A,—1) is the anti-normal characteristic function), 
that is, just the average of the pair of exponential functions which are written 
in anti-normal order 


X(A, —1) = X4(A) = Tr {p exp (—A*a) exp (Aat)} 
= Tr {2 exp (—A*a) = fia <a.| da exp (2a). (123) 


Here we have written that same statistical average on the line beneath but we 
have written the operator 1 in between the two exponential functions as an 
expansion in terms of the coherent states. Now we evaluate the trace. To 


QUANTUM THEORY OF COHERENCE 103 


evaluate the trace, of course, we push the integral sign to one side and the 
differential to the other side with (1/7). To evaluate the trace we just take the 
expectation value of the expression inside the brackets due to the coherent 
state. But exp (Aa‘) is applied from the right to the left on a coherent state, the 
exponential function exp (—A*a) is applied to a coherent state, so those turn 
into ordinary numbers and what remains finally is just an integral, in fact a 
Fourier integral of the diagonal matrix element of the density operator p, 


X(4, -1) = | exp (Aa* —A*a) Cal pla) a (124) 


So, in other words, the anti-normally ordered characteristic function is just 
the Fourier transform of the diagonal matrix element of p. 

Now we defined the function W for the general case of arbitrary s as the 
Fourier transform of X, and X is consequently the Fourier transform of W. 
Well, we can conclude from this, that W(a, — 1) is just the diagonal matrix 
element of the density operator in a coherent state, 


W(a, —1) = <alpla>. (125) 


There are many properties of the Wigner function that we easily prove this 
way but we are not too interested in symmetrical ordering so let us pass over 
them. We have given three identifications of values of the function W. The 
first is that W(a, +1) is xP, the second is that W(a, 0) is just the Wigner 
function, and finally, W(a, —1) is simply the diagonal element of the density 
operator in a coherent state. 

An example of the sort of relationship between these functions may be 
proved easily in the following way. The ordered characteristic function we 
have defined is just the ordinary one times exp (45|A|). So the relationship 
between two of these ordered characteristic functions is obviously given by 


X(A, s) = X(A) exp [(4s]A|7)], 

X(A, 5) = X(A, t) exp [4(s—r) {A[?]. (126) 
The quasi-probability functions W are the Fourier transforms of the charac- 
teristic functions. We take the Fourier transform of both sides of (126). On 
the right we must have a convolution integral, and the kernel of that con- 
volution integral is just the Fourier transform of this particular Gaussian 
function. That means that we have the following general relation among these 
quasi-probability densities, 


2 
Was) = —— [ expt—2la—-srie—o] wep, SF, 


for Ret > Res. (127) 
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The W(«,s) are all related to one another, they are all Gaussian convolu- 
tion integrals of one another. For example, if ¢ = 1 and s = —1, the diagonal 
matrix element of the density operator is just this particular integral, 


<alplay = | exp (—la— BI?) P(B) d°B. (128) 


This is a trivial relationship, it is one which we see immediately by just taking 
the expectation value of the P-representation. 


W (ce,s) 


s=1 


For pure coherent state 
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There is a large family of such relationships. What happens, in fact is this. 
As we go along the succession of orderings from s = 1 to s= —1, we find 
smoother and smoother functions. A state, let us say a pure coherent state, 
for which we have a delta function corresponding to normal ordering, cor- 
responds to a Gaussian function for s = —1, see Figs. 6 and 7. There is a 
tendency always to smooth the functions as s decreases. For a more general 
state, the function P might be something which wiggles a great deal. Figure 8 
is an example with a good deal of wiggle in it, including negative values. As 
we go towards the other end of our spectrum of quasi-probability functions, 
and go eventually toward those which deal with anti-normally ordered 
operators we tend to find smoother versions of the functions, for example, we 


Wie,s) 


s=-1 


For pure coherent state 
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discover that the Gaussian function which carries out the smoothing process 
wipes out the negative values entirely and we have a function which is never 
negative as shown in Fig. 9. 


Fic. 8 


There are many games we can play with these functions; we can answer a 
great many questions about how explicitly to order operators. For example, 
we might imagine that, given any particular operator, then it can easily be 
cast into anti-normally ordered form, or normally ordered form. If we are 
dealing with a polynomial in the creation and annihilation operators, it is 
obvious that that can be done by just using the commutation relations. But 
it is much less obvious whether those two operations can be carried out when 
we are dealing with infinite series in the creation and annihilation operators. 
In this case, the peculiar result that emerges is this: A very broad class of 
operators can, indeed, be expanded in terms of normally ordered products, 
but it is only a very much smaller class of operators, which can be expanded 
in terms of anti-normally ordered products of creation and annihilation 
operators. That is simply a mathematical result. It is one which again stands 
in the way of the general construction of the P-representation. One of the 
advantages of this generalized treatment of orderings is that we have the para- 
meter s with which we can attempt to define the P-representation in singular 


W(cz,s} 
s=-1 


For general state 
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examples. It turns out that that procedure actually fails, that there is no means 
by which we can define the P-representation in general, for all possible states. 
At least, there is no method within this formalism which permits us to do this. 

These questions of operator ordering, expansions, and the like are not 
physics. They are, at best, a matter of mathematical convenience, and at worst, 
a kind of preoccupation which consumes us as we discover more and more 
interesting theorems. Let us get back to physics. 

Unhappily, the most typical state of the world is not a pure coherent state. 
We have lived for a long time without having to pay any attention to coherent 
states, physically. The most typical state of the world, the one which seems to 
prevail in the overwhelming proportion of cases, is the state of chaos. To 
define chaos we define the entropy of the field. The entropy of any chaotic 
system has this familiar definition which goes back to von Neumann, 


S = —Tr (p log Pax . (129) 


There are some constraints in maximizing the entropy. Obviously, the trace of 
the density operators has to remain 1. Suppose we fix the mean number of 
quanta present in the field, we then have 


Tr pata = <n). (130) 


The problem then is to maximize (129) with the constraint of (130). In saying 
that the typical state of the world is chaos, all we are saying is what any statis- 
tical mechanician would say (they are all cynics). We have to quote some good 
reason to have the world in a state other than one of chaos. There are quite 
frequently good reasons. But if someone gives us a light source and asks for 
our best guess, of its state, saying that it has such and such an intensity, then 
the very best guess, in the absence of information is the one which maximizes 
entropy. That is the statement of information theory. It is very easy to carry 
out the mathematical operation using Lagrange multipliers to maximize the 
entropy and the density operator that we get for a single mode is this: 


Trop [Too] 


That may not look like a familiar density operator, but in fact we are quite 
familiar with it for the case of thermal excitation, which, by the way, is only 
one particular example of chaos. (It is really making a mistake to say that all 
chaos is thermal, the radiation which emerges from an electric discharge tube 
at an energy of a few electron volts is certainly chaotic in character, but it is 
by no means thermal.) For the thermal case 


1 
~ exp[(ha/kT)—1]° 


p= (131) 


«n> 
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Then (131), in the thermal case becomes the Planck density operator. 

It is easy to evaluate the functions (102). To obtain the diagonal matrix 
element of the density operator, we just sandwich p between two coherent 
states, 


ee hi (<<. «n> \i 
<alpla> = <a Tray oP (ES (GF). (132) 
We can easily evaluate the infinite sum and get 
Sec igi = = 
<alpla> = Tea [—lal*/(+<n>)] = W(@,—1). (132) 


Taking the Fourier transform, we obtain the characteristic function for 
s= —l, 


X(A, —1) = exp[—(1+4+ <n) JA|?]. (134) 


If we then multiply by the exponential function which gives us the general 
form of the characteristic function for all values of the ordering parameter, 
we have the result: 


= 
X(A, s) = exp |- (“5° + <n] uP] ; (135) 
Now, if we go back and take the Fourier transform once more we have the 
result finally, for the whole family of quasi-probability distributions, 


. 2 —2|c|? 
UGS Scaggs te P(sani). ee) 


In particular, for s = 1, we have the simple Gaussian function, 


P(a) = — W(a, 1) = + exp (— al?) <n)). (137) 


Ae 2 

Nothing could be more obvious or simple for the case of chaos. One of the 
best ways, perhaps, of defining chaos, is to say it is what we must have when 
we have many independently contributing elements. If we take any light source, 
which is divisible in principle, into many small parts, and assume that they all 
contribute independently, then it is inevitable from the central limit theorem 
of probability theory (there is a simple generalization in the quantum mech- 
anical case), that we have a Gaussian distribution of amplitudes. That is 
precisely what we mean by chaos. We shall see below what results from this 
chaos as far as measurements on the field are concerned. 
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8. CHAOTIC STATES 


All of the older and more traditional measurements in optics are ones that we 
can describe in terms of the first order correlation function G“. In a sense, the 
older optics by measuring intensities only, really looks no further into the 
field than into the contents of the function G“. For the case of a stationary 
field, as we shall see, the first order correlation function is determined ordi- 
narily just by the spectral data of the field, i.e. the intensities of excitation of 
the various modes of the field. When we say spectral data we mean polariza- 
tion data as well as the usual sort of spectral data. We can imagine different 
sorts of fields, which have identical spectra. We might take them from dif- 
ferent sources, such as a laser or a black-body; we might have intensity 
problems, but we can, presumably, filter black body radiation to achieve any 
spectral radiation distribution that we like, including the distribution which 
comes from a laser. Then we might ask the question, “‘Are these two fields 
distinguishable?” If their spectra are identical, the functions G will be 
identical, and then no matter which of the traditional optical experiments that 
we perform, we shall never be able to distinguish between these two light 
beams. 

On the other hand, the distinction becomes very clear when we begin to 
look at the effects that are involved in the higher order correlation functions. 
The first of these was the HBT effect. We have to measure the second order 
correlation function or some higher order correlation function in order to see 
the difference. For the particular case of chaotic fields, it must still be true 
that these higher order correlation functions are determined simply by the 
spectral data of the field. We know that for a multi-mode, chaotic state, the 
field (P-representation) can be described in terms of a multivariate Gaussian 
function 


P({ay}) = [I~ exp (—lenl?/<m,)). (138) 


K 1M» 


The only information that we need in order to construct this function is just 
the mean number of photons in each mode. Those are exactly the same data 
that are contained in the first order correlation function. So, for the case of a 
stationary Gaussian field, i.e. a chaotic state, the entire density operator con- 
tains no more information than is contained in just the first order correlation 
function. If we measure the first order correlation function for such fields, we 
measure everything. Now that is not to say that the higher order correlation 
functions have no interesting structure, in fact they have some structure, but it 
must be possible for the case of chaotic fields to construct the higher order 
correlation functions from a knowledge simply of the first order correlation 
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function. There is a theorem which permits us to do that. It is a generalization 
of a theorem which holds in the theory of stochastic processes quite generally. 
If we use this Gaussian distribution (138) to construct the correlation function 
of nth order, we find (remember it has altogether 2n different arguments, in 
general) that the higher order correlation function can always be expressed as 
the sum of products of the first order correlation functions and for the nth 
order correlation function we have to form an n-fold product; we have to sum 
n-fold products over all possible permutations of the coordinates, 


GC (x4 oe XV oa) = = U G™ (x; yp, (139) 
In other words, if the first coordinate in G" takes on all the values from 
1 — n, the coordinates associated in the same functions with those coordinates 
constitute the m! permutations of the set of numbers 1, ..., 7. 

As an example of such a function, consider the two-fold joint counting 
rates, the second order correlation function, in other words, with its arguments 
repeated; x, occurs twice, x, occurs twice. That is a sum of two-fold products 
of correlation functions. The identity permutation corresponds to the first 
term, and then the simple interchange of the x, and x, coordinates determines 
the second term. The two factors are complex conjugates of one another so we 
simply write the squared absolute value, for the second term: 


G(X XXX) = G(x, X,) GO (Xz x2) + |EO(X, x2)]?. (140) 


If the field were a coherent one, only the first term would be present, the 
correlation function would factorize and all we would have is 


GO, X2%2X1) = G(x, x1) G(x, X2). 


This means that there would be no correlation at all in the counting rates 
observed by two counters placed in the field; there would be no tendency 
towards photon coincidences. The additional term |G“(x,, x.) |? is due to 
the fact that the field to be averaged has a Gaussian distribution of field 
amplitudes. The term |G“(x,, x2) |? is the one responsible for the HBT effect. 

Let us now see how we calculate first order correlation functions. In most of 
the stationary fields we work with, we can always choose a suitable set of 
modes such that the mean value of the products of the creation operator for 
the mode k and the annihilation operator for the mode k’ vanishes unless the 
modes k and k’ are the same and then, of course, it must be just the mean 
number of quanta in the mode, 


Tr {pa,tay} = <ny> Sx - (141) 


Now in the stationary field, the expectation value when these mode indices are 
different must vanish if the modes are at different frequencies. If the modes 
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are of the same frequency, it is possible to have non-vanishing values for 
different modes, but then a suitable choice of the modes k and k’ (in other 
words the use of the correct principal axes in the Hilbert space of modes) will 
lead to a reduction of the expectation value to the form (141) in any case. Now 
if this is true, then we find the first order correlation function by just sub- 
stituting for the expectation values in the formula for the expansion of G™ in 
terms of the mode amplitudes, and we obtain 


G (rt, r’t') = 49 hea, (n> uy*(r) u(r’) exp [ia,(t—t’)]. (142) 


We see how for the stationary field simply the set of spectral data (the set of 
mean occupation numbers) determines the first order correlation function. 

Consider a specific example, say a beam of plane waves. As long as we have 
the symmetry which describes plane waves, i.e. a beam moving uniformly in 
the + x direction for which the mode function is u,(x) ~ exp (ikx), there need 
be only one argument in the first order correlation function. We shall call that 
argument Tt, 


| 
GY = G2), t= ty tp ———(%1 —X4). (143) 


The function G" is independent of y and z if x is the direction of propagation 
of the waves. Let us make a spectral assumption about the plane waves; a 
simple one to make is that the occupation numbers have a Gaussian behaviour, 
i.e. we have a Gaussian line profile such as may be given by Doppler broaden- 
ing. 


<n.» = constant - exp [—(@—@,)?/2y7]. (144) 


We could work with any line profile but since we have to take the Fourier 
integral of the spectral density, the choice of a Gaussian makes this easy. The 
integral is the summation over modes. We assume that the modes are closely 
spaced and therefore we can integrate over the propagation vectors or, equally 
well, frequencies (since we have specified the direction of propagation). We 
find the first order correlation function in terms of the interval variable t is 
given by 


G(r) = G0) exp (ia ot — 4717). (145) 


We see that G" oscillates with the central frequency of the line and that it 
decreases for values of t # 0. The envelope of G) has a Gaussian sort of 
behaviour (Fig. 10). 

In a photon coincidence experiment we measure G®). To calculate G2) we 
note that G(x,, x,) is the function for which t vanishes so we have G“(0) 
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G(0) as the first term of (140), 
G(x, x2, XX) = G(X, X1) GP (x,, xg) +1GM (x4, X2) |? 
= GO) GO)+ |G) |? exp (—y??)_——(146) 


The cross correlation function gives the variable contribution. The square of 
the cross correlation function does not have the oscillating part, it is simply 
the squared envelope, as it were, of G™’, so we get 


GO %2, X2X4) = {GM(0)}? {1+ exp (—y?t7)}. (147) 


By plotting G (coincidence rate) against t (Fig. 11), we see that G2) + 1 as 
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t— + 00 butis 2 att = 0. This shows that the HBT effect is not a small effect. 
In the original experiment it was small because the counters had long resolu- 
tion times. The HBT effect does not last over a long time interval. The time 
interval t is of order of 1/y where y is of the order of the frequency band width, 
so it helps to have a light beam with very narrow frequency spread to make 
the effect last for a somewhat longer time. The shape of the curve here is 
Gaussian because that is the transform of the spectrum we used. Had we used 
a natural line-shape, then we would have had an exponential behaviour of the 
HBT correlation on each side of t = 0 as shown in Fig. 11. Notice that we are 
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describing the effect without putting any mirrors in the calculation at all. We 
are just considering detection of photons in a one dimensional beam with two 
counters which we assume have infinitely small resolving time. There is always 
a tendency towards coincidences and this is, in effect, a statement that the 
photons tend to clump in the beam. We can still ask why this happens. The 
argument we have given is mathematically complete but it may be that there is 
not enough in what we have considered to convey the full physical image. In 
other words, the theorem which says that the higher order correlation func- 
tions can be expanded in terms of the lower ones is the essential point of the 
derivation. Everything we ever need to know about the character of Gaussian 
stochastic processes really is included in that statement but it is not, perhaps, 
completely clear what that statement is in physical terms. So let us use a 
slightly different method. We need not use this fancy expansion to evaluate 
the higher order correlation function, we can go back to the Gaussian distri- 
bution itself and imagine doing the integral. Let us see how that works. 
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Consider the correlation functions fort = 0. The HBT effect is there already; 
we do not see how the effect relaxes over time, but we do see, for example, that 
the coincidence rate is twice the square of the ordinary counting rate, eqn. 
(149). For time interval zero, G“(0) is just the second moment of P(«), 


G20) = | P(a) |6|2d2a = constant | exp (—|al2/<n)) |al2d2a. (148) 


Here for simplicity we have assumed that just one mode of the field is excited. 
Nothing in the argument really depends on the number of modes that is 
excited except the relaxation properties. The fact that there are many different 
modes excited which all have slightly different frequencies obviously deter- 
mines the way in which the HBT correlation disappears as a function of time, 
but here we are looking at the effect just at one moment of time, at time 
interval zero. In that case GY is just the second moment of the Gaussian 
distribution. : 
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The function G is just the fourth moment of the Gaussian distribution, 
G2 = [P@ \6|*d2a = 2( | P(a) isa)? = 2{G0)}?. (149) 


It is a property of Gaussian functions that the fourth moment is twice the 
square of the second moment. Does that have any physics in it? Well, it is a 
little hard to call it physics—it is so simple. The Gaussian function has a long 
tail, even though it decreases quickly (Fig. 12). The fact that the Gaussian has 
this tail means that the fourth moment is larger than the square of the second 
moment. If the Gaussian function were to cut off sharply, it would not be 
possible for the fourth moment to be so very much larger than the square of 
the second moment. So here we have it. It is because the Gaussian function 
has a tail that we find that these higher order correlation functions evaluated 
for time zero are generally larger by a factor of n! than the nth powers of the 
first order correlation functions. For a coherent field, on the other hand, with 
random phase, we have essentially a 6-function structure. If we had a precisely 
determined phase, for a coherent state, then, indeed, we would be considering 
a 6-function at one point in the complex plane. We essentially never have such 
information about laser beams; we never know, for example, when precisely 
they are turned on. So the actual representation of the function P for the most 
perfect of laser beams would be one in which there is a kind of circular 
symmetry in the complex «-plane and we have a probability distribution which 
is zero, except on a cylindrical shell, a shell obtained by rotating the diagram 
Fig. 13 about its vertical axis. That is what P would look like for the most 
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perfect of coherent states with random phase. Now we see that that kind of 
distribution has no variance at all in radial terms. For that kind of distribu- 
tion it is simply true that the fourth power of & when averaged is precisely 
equal to the average of & squared, and squared again; that is why there is no 
HBT effect for the purely coherent field. 
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Let us put it in still more elementary terms. In a chaotic field, the amplitude 
is fluctuating all the time. Sometimes the amplitude is large, more often, the 
amplitude is small. If we put two photon counters in the field, they will both 
of them tend to register when the amplitude happens to be large, and they will 
neither of them tend to register when the amplitude happens to be small. That 
automatically means that there will be a correlation in the behaviour of the 
two counters. The two counters do tend to register together and they tend to 
keep quiet together. No such correlation is present in the coherent field 
because the amplitude tends to remain constant as a function of time. Both 
counters are seeing the same amplitude at all times. Not just the same as one 
another, but an amplitude constant in time. For that reason, their behaviour 
is statistically independent of one another. Now that leaves us with a number 
of other questions, of course. It was surmised at an early stage that a laser 
beam was represented to quite good accuracy by one of these coherent fields. 
Why should this be so? It is a long story to analyse in detail. One reason is 
that, to a fair approximation, a laser is a macroscopic distribution of atomic 
polarizations equivalent to a macroscopic current distribution which does not 
recoil perceptibly when it emits photons. A laser is, to a good approximation, 
the kind of thing Kibble described by speaking of the radiation emitted by a 
specified current distribution (see Kibble, this volume). The situation is one in 
which the current is specified and the field is quantized, but that is not enough 
really to characterize the situation physically. 

We could still imagine random behaviour in this classical current which 
would lead to much variance in the amplitude radiated by the field and there is 
something intrinsic to the laser which tends to suppress that amplitude 
fluctuation. The laser is a non-linear device which has a kind of feed-back 
built into it. The field is radiated by a population inversion which has to be 
maintained in order for the device to keep amplifying its own field. If the field 
ever gets stronger than a certain steady value, it tends to deplete the inversion, 
the amplification tends to decrease, and the device tends to stabilize its own 
amplitude. The theoretical analyses of the laser do not really show that a 
precisely coherent state is radiated by the device, but they do tend to show 
that, at sufficiently high power levels, quite a fair approximation to such a 
coherent state is what is radiated. The complications of dealing with the laser 
problem itself are quite imposing and are described by Haken in this volume. 

We see that to explain the HBT effect quantum mechanically we go back, 
in essence, to its classical explanation. The important point is that we have 
found the justification for using that essentially classical explanation in the 
quantum context. 

Those are the best ways of explaining the HBT effect in elementary form. 
However, every time I present explanations of this sort, I find students come 
up later and ask, “‘What about the photons in all of this? What you have done 


QUANTUM THEORY OF COHERENCE 115 


really is to persuade us that it is not worth talking about photons at all. What 
you ought to do is go back to the same old classical discussions that were used 
long ago and simply give them their correct quantum mechanical interpreta- 
tion.” Now, all of us were taught somewhere in the early chapters of the 
quantum mechanics books that photons are little puffs of probability ampli- 
tude; that they have well defined wave-packets that move with the velocity of 
light and tell us whatever we are capable of knowing about the localization of 
the photon. In our discussions of photon counting experiments, of the HBT 
effect in particular, we do not see any of those wave-packets at all. My own 
feeling is that we are much happier for not having the wave-packets present. 
Conceptually, the wave-packets are certainly a very good way of looking at the 
localization of quantum mechanical particles but, in practice, we never know 
the various parameters that we would have to know in order to determine the 
wave-packets for individual particles. If a cyclotron produces a beam of 
charged particles, there is nobody in the cyclotron laboratory who can tell us 
what the wave-packets are of those emerging particles. What they can tell us is 
the degree of collimation of the beam, the energies of the particles in the beam 
and general data of that sort. The simplest description of that beam is a 
description analogous to the kind we have been using here for photons; it is 
really a density operator description which does not attempt to construct 
wave-packets for the individual particles because, to do it, we have to deal 
with information that we simply do not have. On the other hand, at least from 
a conceptual standpoint, it is an interesting thing to ponder the problem of 
just how we would go about constructing a description of a beam of particles 
if we did know everything about the wave-packets of the individual particles. 
That is what we shall do in the next section for the particular case of a chaotic 
field. 


9. WAVE PACKET STRUCTURE OF CHAOTIC FigLtp”° 


The formula for the chaotic density operator is just the Gaussian mixture of 
coherent states, 


a = 2 d7a, 
p = [ expL— Sou? m>y) oud ><Cae} ITT os (150) 
Evaluating the Gaussian integrals we obtain, 
p= Dee lo} Cor (151) 


(ma) «(1 +<m>)* 7 ™ 


The density operator is the sum of state vector dyadics corresponding to all 
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possible numbers of quanta in the individual modes. Because it is a stationary 
operator and has a separable or factorizable P-representation it is, in fact, 
diagonal in terms of the occupation number representation. The coefficients 
of the various quantum states are just powers of elementary parameters. The 
parameters are just the ratios {<n,)>/(1+<n,>)} which for the case of thermal 
radiation, when raised to the power m,, become 


<m> \m 
(5) = exp ( m,ho,/kT). (152) 

Suppose we are given a collection of as many identical photon packets as 
we want. How do we put those photon wave-packets together to make the 
field which is described by the chaotic density operator? When we discover 
how to do that, we shall learn something about the microscopic structure of 
chaotic fields. We shall find out, within chaotic fields, something about where 
the photons are relative to one another and the distribution of their number. 
A one-photon packet is described by 


> f(k) @;*| vac). (153) 


This is the basic element in the calculation. We are going to create a single 
photon but, we may create a photon in any mode so we sum over all modes, 
k. It takes a certain set of amplitudes, one amplitude f(k) for each mode in 
order to specify what the packet is. We can make the packet look like anything 
we please by appropriate choice of this set of numbers f(k). Imagine that the 
packet is localized somewhere in space. In fact, it might be localized in a differ- 
ent way. Nothing we are saying really will bear on the question of just how it is 
localized but imagine that it is a wave-packet that has some particular spatial 
localization. Let us even assume that that packet is somewhere near the origin 
r = 0, just for simplicity. This state has to obey a normalization condition, 
which we obtain by just squaring the state, thus 


<vac | LIK) Oy LI) a,*| vac) = > If) |? = 1. (154) 


The square is the sum of the squared moduli of the coefficients f(x). So those 
coefficients f must obey the normalization (154). Now let us imagine displac- 
ing this wave-packet from the origin to the point r. We do that by applying a 
displacement operator, which in quantum mechanics takes the particular 
exponential form shown: 


exp (—iP.r/h) ) f (kK) a,*| vac> 


= )\f(&) exp (—ik .r) a,*| vac) 
= At(r)| vac). (155) 
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We know what the momentum of any one of these photon states is. Let us 
assume now that it is a state with a particular propagation vector. Let us 
assume that the modes k, are plane-wave modes. So to apply the displacement 
operator means just including in our summation over modes, the particular 
phase factor associated with that propagation vector. The summation of 
creation operators which creates a photon packet in the neighbourhood of r 
can be written as 


At(r) = ¥ f(x) exp (—ik.r)a,". (156) 
k 


These are the basic elements in the “do it yourself kit”. We are given as many 
as we want of these packet creation operators referring to all possible points 
in space. The problem is to put these operators together so as to create a 
chaotic field. The method is not immediately obvious. If we just take a col- 
lection of the operators and apply them completely randomly (or in what we 
think is a completely random manner) to the vacuum state, we shall not, in 
fact, be creating chaos, we shall be creating order. 

The state with n packets identical except that they are shifted relative to one 
another in position so that they are at the positions r,, ..., r, is given by 


[I At(r,) | vac). (157) 
j=1 


Here we have applied the appropriate succession of creation operators to the 
vacuum state. It does not make any difference what order we apply those 
operators in, since the creation operators all commute with one another. In 
fact, that shows us that we are obeying the necessary symmetry rules; this 
state is symmetrical under the interchange of the coordinates of all the wave- 
packets, so we are describing bosons correctly. 

We define the normalization function for the state (157) as 


Ww ({r,}) = < vac| [] A(r,) [] At@) | vac>. (158) 
j I 


Let us describe an essential problem that we encounter in defining the state 
given by (157). The state is not normalized and the normalization of the state 
depends on just where the photons are relative to one another. If we are going 
to normalize the state, we have to calculate the norm of the vector we have 
written (its square length, in other words). The squared length is written out 
on the right of (158). It is the vacuum expectation value of a product of packet 
annihilation operators and packet creation operators. To evaluate the function 
W({r,}) it is a good idea to define another function which we can construct 
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in terms of a single creation operator for a packet at one point and a single 
annihilation operator for a packet at a different point, 


v(r—r’) = <vac |A(r) At(r’) | vac). (159) 


The operator product can be written equally as a commutator because if we 
consider the two operators in their commuting positions, we then have an 
annihilation operator on the right which applies to the vacuum state, and it is 
zero. Since the commutator is a c-number the function v reduces simply to the 
commutator itself, 


v(r—r’) = [A(®), AT(r’)]. (160) 
By using the expression (156) and its adjoint we then find 
v(r—r’) = LIFT exp [ik . (r—r’)]. (161) 


This is the explicit expansion of the function. If we know the packet amplitude 
J (k) then we know the function v. 

The function v, however, does not tell us anything about the phases of the 
packet amplitudes. It is easy to prove the following properties of function », 


v(0) = 1 


v(—r) = v*(r) 
jo(r)| < 1. (162) 


Firstly, the normalization condition; if r = r’, we have just the sum of the 
squared moduli of f(x), that is, unity. If we invert the argument of the function 
we get the complex conjugate, obviously. Furthermore, this function is 
bounded by unity, which is the value it takes on for zero argument, and if we 
calculate this function, we find, typically, for any smooth behaviour of the 
amplitudes f(x), the function goes to zero as the points r and r’ recede from 
one another. The packet, in other words, would have only a finite size. Any 
photon packet would have some characteristic packet dimension and v would 
vanish for |r—r’| larger than that particular dimension. 

The function W™ which is the normalization function, is easily seen to be 
the sum of products given by 

Wr }) = YT] vry—rp)). 
perm j=1 

We can use the same technique as we used to evaluate high order correlation 
functions for the Gaussian case to prove this theorem for the normalization 
functions. What that means then for the first few cases is worth illustrating. 
For a single photon packet we have 


W) = 1, (163) 
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If we have two packets in the field, W contains the product v(0) v(0) that is, 
just unity, and then we have the additional term with the arguments permuted 
which just adds |v(r, —r,) |7; thus 


w®) = 1+ |v(r, —r2) ie (164) 


If we have a state with three packets in it we have a somewhat more compli- 
cated structure, 


WO = 1+ |o(ry—12) |? + lot, —13) ? + lors) |? 
+2 Re v(r, —r,) v(F2—r3) v(r3—1r,). (165) 


These two cases W™ and W®?, illustrate the structure of the normalization 
function W™ in its simplest form. If the packets are far apart, for example, r,, 
is very different from r,, then the overlap amplitude v vanishes and if all of the 
packets are far apart in the n-fold case and tend not to overlap, then all of 
these normalization functions are going to reduce, essentially, to unity 
Ww™ ~ 1. The normalization functions, however, tend to differ from unity 
when the packets are made to overlap. When |r, —r,| is small enough so that 
the packets do overlap, for example, the function v rises to its maximum value 
of one, and, more generally, we find that the maximum value of W™ is n!. 
So that 


nl>w™so., (166) 


If we imagine distributing wave packets at random positions in the field, 
we shall find the normalization function is essentially always one, or nearly 
always one if we have a very dilute photon gas. But as the packets begin to 
overlap, the normalization moves away from unity and every time two packets 
precisely overlap there is a bump in the normalization function. We have to 
take that into account of course if we want to deal with a correctly normalized 
state. One of the hazards of dealing with wave-packets is that calculations 
based on them have not always been done with correctly normalized states. 

A correctly normalized state for wave-packets is given by 


1 

r; = t r; Cc). 
\In{rj}> Twp I] A (r,) | vac) (167) 
We have taken the state generated by the m packet creation operators and 
have put in the reciprocal of the square root of our normalization function. 

In our process of constructing the chaotic field we naturally want to put 
these photon packets in all possible different positions in the field, that is, we 
naturally think of a chaotic field having properties we assume to be uniform 
as a function of spatial variables. We wish to consider a mixture of states in 
which these photon packets are in all possible positions. A pure state with one 
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particular set of positions for the photon packets is given by (167). We mix 
these states together by multiplying by some kind of a weight function of the 
photon packet coordinates, then integrate over all photon packet coordi- 
nates. We know that the total number of photons in a chaotic field is quite 
indefinite because, in general, we do not know how many photons there are in 
any one mode so we have only a statistical notion of how many there are in 
the entire field. We therefore have to use a succession of these weight func- 
tions, one for every value of m and sum over all possible values of the number 
of photons in the field. The density operator we construct is given by 


pS] Fats) fe} dQnle ee (168) 


n y" 


We now wish to know the kind of weight functions we should choose in order 
to obtain a chaotic field. In other words, we wish to generate real chaos, to 
scrupulously avoid order, to avoid having the entropy of the field at anything 
less than its maximum value. The crucial point is the location of the photons 
to maximize entropy. At this point it should be noted that we can prove that 
the correct weight function for a chaotic field is 


FQ, ie r,,) = CW ry nee r,), (169) 


where c, is a constant. The proof is a bit long and is therefore omitted, but 
basically one way of proving it is as follows. We take the matrix elements of p 
between coherent states, and go back to the original chaotic Gaussian density 
operator (150) and take the matrix element of that between coherent states 
and then compare the two expressions obtained. We want them to assume the 
same form and the only way they can ever have the same form is by taking 
these weight functions F, to be proportional (the constant of proportionality 
is c, which is independent of packet positions) to the normalization function 
W™), Now that is simply a mathematical result and the only way of justifying 
it is by going back and making the explicit comparison and showing alge- 
braically that that is the only way we can do it. But we have there an interest- 
ing statement about the microscopic structure of a chaotic field because this 
function F,, is a weight function telling about the tendency of photon packets 
to overlap, (whether they really tend to overlap or not). The statement 
is here that they must have a distinct tendency to overlap. The function 
W™ is something which is unity when no photon packets overlap. However, 
every time a pair of photon packets overlaps it jumps away from the value 
unity to a larger value. In other words, this weighting tends to favour the 
overlap of wave packets. Many things are said about the HBT effect. A number 
of authors state that it is purely and simply a consequence of Bose-Einstein 
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statistics. We can see here that that statement is an extremely incomplete one, 
and if that is the only statement which is made, it is incorrect. It is incorrect 
because we can imagine photon wave-packets having any distribution of 
positions we please. Boson statistics are extremely permissive, we can imagine 
constructing states with any kind of anti-correlation we like between photon 
packets. How we would generate those states and what the practical device is 
that would do it, is another question. It might be very difficult to think of a 
device which would create anti-correlation between photons. 

The essential point is this. For Boson statistics and a requirement of maxi- 
mum entropy, then the only way of meeting the requirement is to give the 
photon packets a distinct tendency to overlap and use the weight function 
W™ in this construction which gives us an additional unit of weight every 
time two packets overlap. If we use this weight function in doing the calcula- 
tion, then the density operator obtained is 


Idr, 
v" ° 


p= TG | At(r,) ... At(r,) vac )< vac |A(r,) ... A(t) 


(170) 


The function W™ put in as a weight, cancels the reciprocal of W™ which is 
inserted in order to normalize, and so the density operator has a very much 
simpler structure. It has precisely the sort of structure that we would have 
guessed if we had guessed wrongly and forgotten all about the normalization 
problem to begin with. That is what has happened in some of the papers that 
have been written on the subject which purport to derive the HBT effect. It is 
one of the reasons why they say it results only from Boson statistics. What has 
happened is that, without any attempt at justification, they have used a 
structure like (170) which is not normalized (it has a variable normalization). 
In fact, the HBT effect has been put into the very first statement of the deriva- 
tion, and, naturally enough, it is taken out at the end. 

We still have to determine the coefficients c,, and it is not clear at this stage 
what the relationship is between our packet amplitudes /(k) and the mean 
occupation numbers n,. It is easy enough to derive these relationships. We 
simply carry out the spatial integrations on the form of the density operator 
given by (170) and we obtain the following expression in terms of the n 
quantum states, 


p= ¥ nlc [TLS P™ Hmm} ><} (171) 


n=0 mo=n k 


This is a double summation, a summation over the total number of quanta in 
the field and a summation over the set of occupation numbers of the individual 
modes. In the first summation the total number of quanta in the field has to 
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be restricted to the value 2 which is then summed over its full range from zero 
to infinity. We now compare this expression with the one that goes back to the 
chaotic density operator (150). In (171) we see powers of the squared modulus 
of the packet amplitude. In the original one we had powers of the parameter 
<n,>/(1 + <n,>) where <n,> is the mean occupation number. Obviously these 
two have to be proportional to one another. So the identification of the one 
density operator with the other requires that 


<m> (1+ <>) = ZIP? 72) 


where Z is the constant of proportionality. Then to complete the identification, 
the comparison of (171) with (150), we have to say that the unknown co- 
efficients c, which occur in the weight functions are given by 


Zz 1 
ar cere: 


= ST10-zis@00), (173) 


The parameter Z is then given by the normalization condition 


2_ east <n%> 
PUG 1m 2 Pe 
This means that the mean occupation number and the squared packet 
amplitude have slightly different shapes as functions of frequency or wave 
number as shown in Fig. 14. When we put the first photon packet in the field 
we may think that the spectrum of the field eventually is going to look like 
| f(A)|?, but it will not because of the effect of wave-packet overlap. There will 
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be the relationship we have written between | f(k)|? and the mean occupation 
number, which is a non-linear relationship. 

Finally, when we put all of this together, we have the following two expres- 
sions for the density operator. One in terms of the wave-packet amplitude 
squared and the n quantum states and the other in which we see explicitly the 
sum of the averages of the pure states, which correspond to the presence of 
packets at all possible positions, 


={Td= 
k 


‘ LF) Pl fer} >< Lore | 


p=T10-ZIf@P)E spe [T]A'G) | vac >< vac Tate) 1] (174) 


niV" 
Now can the latter expression be used for anything? 

Its principal use has been given above and lies in the statement that, in 
constructing a chaotic field from wave-packets, it is necessary to put the 
packets together with a weighting W™ which tends to favour the overlap of 
packets. We can use this form of the density operator for various sorts of 
calculations, for example, it is a relatively convenient form for finding the 
distribution of the number of photons in the field. But if we try doing photon 
counting calculations with this representation of the density operator, we find 
that the complications are immense. For example, to describe the absorption 
of a single photon from the field we have the problem that the photon may be 
absorbed from any of these packets and just to describe the single, most ele- 
mentary process of absorption of one photon, we would have to take into 
account interference terms in all of these packets because we generally do not 
know from which packet the photon was absorbed. So the use of photon 
packets does lead to immense algebraic complications in actual calculation 
and it is something to be avoided. On the other hand, we can carry through the 
comparison of the density operator formalism that we have used with the 
photon packet scheme and in that way see something about the microscopic 
structure of the field. 

One of the advantages of this approach is that the treatment of Fermions is 
really no different in principle from the treatment of Bosons. That is one of the 
real gains in using this particular approach, because we cannot use the coherent 
state approach at all in discussing Fermion beams. There is simply no such 
thing as a coherent state for Fermions. For any Fermion mode we can only 
have the occupation number zero or one, and there is no such thing as an 
eigenstate of the annihilation operator. An eigenstate cannot be constructed 
with those two possibilities. Try doing it algebraically and you will see what I 
mean. There are no coherent states for Fermions but it is easy to construct 
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maximum entropy states for Fermions. We can do that just by inspection. 
Since we are specifying the mean number of Fermions present, we are saying, 
in other words, that Tr(pata) for one mode is equal to a certain fixed expecta- 
tion value, 


Tr(pa'a)=<m), om = {9 (175) 


That means, as part of the maximization process for entropy, that the density 
operator has to be diagonal, so the density operator must be a linear combina- 
tion of the projection on the zero quantum state for one mode and the one 
quantum state. It must have the specified expectation value, and it must have 
trace unity. We have only two unknown coefficients and that determines the 
two coefficients, so in fact, the density operator has to be this: 


p = (1—<n)) |0><0|+<”>|1><1| where Trp = 1. (176) 
This may be generalized to fields containing many modes by the same argu- 
ments as for Bosons and we find the following relation between the occupation 
numbers for Fermions and the packet amplitudes, 


< Ziy@P=z= Ye (177) 


1—<n,> k 1—<n,>’ 
where the coefficient Z is determined again from the normalization condition. 
The density operator obtained, finally, has almost the same structure as that 
for Bosons, 


1 

°=Niarmer 

xy a i [I4'@) | vac >< vac | []4@) [] dr (178) 
n! j Jj Jj 


As far as the appearance of the summation over states in which we have used 
the packet amplitudes is concerned, that is precisely the same. The normaliza- 
tion factor simply looks a little different. The point is that the operators A(r) 
anti-commute, and so the function v is the anti-commutator of the two opera- 
tors, the one for creation and the other for annihilation of a packet, 


{A(r), ATF}, = o(r—r’). (179) 


The normalization function W™ is an antisymmetrical sum over permuta- 
tions of products of the functions v, 


Ww” = > (—1)? Mv(r;—rp)). (180) 
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That means that the packets will tend to avoid one another. This is a normali- 
zation which will go to zero whenever two packets overlap. In other words, in 
attempting to put together the chaotic density operator for Fermions with our 
“do it yourself kit” we lose every time we set down two particle packets over- 
lapping each other. We are only allowed to put states together with packets 
not overlapping and, indeed, there is an anti-HBT effect for Fermions, and 
that does come purely and simply from the Fermi statistics. That would be 
true whether we were considering a chaotic state for the Fermions or any 
other state. What we have shown is that a chaotic state for Fermions does 
have a particularly simple representation which is completely analogous to the 
one which we derived for Bosons. 
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Photon Statistics + 
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INTRODUCTION 


In this chapter I shall be discussing the statistical properties of optica 
radiation fields. I imply immediately that we are interested in describing light 
fields in terms of randomly fluctuating quantities, and that statistical cor- 
relations between these random quantities are experimentally measurable and 
amenable to theoretical interpretation. I should like first of all to spend some 
time explaining why, since optics is such a venerable subject in physics, 
such statistical considerations seem to be comparatively recent. It is, nowa- 
days, well realised that if we consider, for instance, one component of the 
electric field vector produced at a point in space by a classical (thermal) 
light source, this is a violently fluctuating quantity. Thinking in rather loose 
terms for a moment, we may consider the intensity of such a source at this 
point in space. This is a random variable whose most probable value is, in 
fact, zero. Thus, if I were to place my eye behind a sufficiently small aperture 
at this point in space, and look at the source, then most of the time I should 
see darkness. The reason why physicists have been able to avoid considering 
the true nature of such sources and to construct the whole of classical optics 
theory as though lamps were of constant brightness, lies in the time scale of 
these intensity fluctuations. The fluctuations are slower, the more mono- 
chromatic the source, but even for the most monochromatic classical sources 
available, the time over which these fluctuations take place is still very small 
—in the region of a few nano-seconds at most. The origin of the fluctuations 
is simply the random beating together of wave trains from various parts of 
the source, which are spontaneously emitted and uncorrelated in phase. At 
the present time, one can construct a source with the same statistical proper- 
ties as a classical lamp merely by scattering laser light from a medium whose 
dielectric constant is undergoing random but slow fluctuations, and one can 


+ The material presented here is a slightly expanded version of a review by the author 
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indeed see with the eye the properties which I have just described. A simple 
example is laser scattering from micron-size polystyrene balls suspended in 
water and undergoing Brownian motion. The time scale of the intensity 
fluctuations in this case is of the order of one-tenth of a second. 

In spite of the short time scale of the intensity fluctuations from thermal 
light sources, the first successful statistical experiments were performed using 
such sources, some years before the laser was invented. These were the 
experiments of Forrester, Gudmundsen and Johnson,“ and of Hanbury- 
Brown and Twiss.%) These were difficult and pioneering experiments at the 
time, but the development of the photomultiplier tube as a detector, and 
fast transistor-electronic circuitry for data processing, makes such experi- 
ments very much easier today. These early measurements of intensity fluctua- 
tions of optical light sources are so often discussed and quoted in relation to 
this subject that, rather than dealing with them in the appropriate place in 
the general scheme of this chapter, it is worthwhile in this introduction to say 
essentially what was measured, and to try to make quite clear exactly what 
the Hanbury-Brown-Twiss effect really is. First of all let me make plain one 
thing that the Hanbury-Brown-Twiss effect is not; it is not a tendency of pho- 
tons to bunch in time, solely because they are bosons. Much of the discussion 
following the Hanbury-Brown-—Twiss experiments would have been clearer 
had the laser existed at the time. A laser is a source of photons which show 
no bunching properties whatsoever. If a photodetector is placed in the field 
of an ideal laser, photodetections will take place completely at random, with 
no correlation between adjacent events. The existence of such a source shows 
that the intensity correlations observed in the experiments of Hanbury-Brown 
and Twiss and of Forrester et a/. are not directly attributable to the boson 
nature of photons. 

In fact a completely classical version of these experiments was developed 
during the Second World War in the field of radar meteorology. The statist- 
ical properties of the electric field of a scattered radar beam are the same as 
those of the electric field produced by a classical lamp. We can imagine that 
the scattering of a radar beam by a cloud of randomly positioned rain drops 
moving in the wind, might be similar to the scattering of a laser beam by a 
random set of Brownian particles, or more exactly to laser scattering from 
particles suspended in a flowing fluid. The common feature is a Gaussian 
distribution of (complex) field amplitudes. The extraction of spectral in- 
formation from the intensity fluctuations of the scattered radar beam pre- 
sents exactly the same problem as the determination of the properties of the 
particle motion in the laser-scattering experiment, or the explanation of the 
Hanbury-Brown-Twiss effect. The same formula—eqn (49) below—applies 
in all these cases. 

The explanation of the radar experiment is quite simple. Consider the 
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intensity received from a single raindrop illuminated by a constant intensity 
beam—it will also be constant. Now consider what happens if two raindrops 
moving at a fixed relative velocity are illuminated simultaneously. The 
received beams will be at different Doppler-shifted frequencies and the inten- 
sity received will fluctuate due to the beating together of the two return 
signals. The time scale of the intensity fluctuations gives information about 
the relative motion of the particles. If we now consider a whole set of particles 
in random relative motion, the same general ideas apply. Each particle pro- 
duces a Doppler-shifted return signal which beats with all the others. The 
spectrum of intensity fluctuations is then the fold of the spectrum of Doppler 
shifts with itself, and the normalized autocorrelation function of the (station- 
ary) intensity fluctuations g(t) = <I(t)1(0)>/<I(0)>? is just the Fourier 
transform of this, and has the general form shown in Fig. 1. 


2:0 


(2) (7) 


(@) To T—> 


Fic. 1. General form of the normalized autocorrelation function of intensity fluctuations 
of either radar returned from a random target, of scattered laser light from a fluctuating 
dielectric medium, or of light emitted by a thermal light source. 


The same form is obtained (with essentially the same explanation) for the 
intensity correlations of a chaotic light source, as measured by Forrester 
et al. and by Hanbury-Brown and Twiss. The source presents to the detector 
a randomly varying (Gaussian) electric field, with a “coherence time” (suit- 
ably defined with respect to the spectral profile) of the order of the inverse 
of the spread of frequencies present. It should be said, however, that this 
classical explanation of intensity correlations does not mean that the spectral 
profile of a source necessarily has a classical origin. The same type of inten- 
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sity fluctuations would arise from a source which exhibited only the “natural” 
line width associated with spontaneous emission, which requires a quantum 
mechanical derivation. Only a very fine distinction can be drawn between the 
“classical-like” intensity fluctuations of an inhomogeneously (say Doppler-) 
broadened source, and the “‘quantum-mechanical” intensity fluctuations 
arising from the natural line width of an atomic transition, in thermal 
equilibrium with its radiation field. 


oe 


(a) (b) 


Fic. 2. Illustrating the “bunching” of photodetections due to a fluctuating light intensity. 
i(t) is the detector output current. 

(a) constant intensity, random arrivals, 

(b) fluctuating intensity, bunched arrivals. 


The association between the correlated intensity fluctuations of a chaotic 
classical optical source and the “bunching” in time of events recorded by a 
photodetector placed in the field, is easily explained with reference to Fig. 2. 
The probability of a photodetection is proportional to the “intensity” of the 
field (a more comprehensive treatment of photodetection will be given later). 
For a constant-intensity field such as a perfectly coherent, single-mode laser, 
the photodetections occur at random in time, while for a fluctuating field the 
obvious departure from randomness can be called “‘bunching”. The Hanbury- 
Brown-Twiss bunching effect is thus a quite simple phenomenon. The effect 
of the usual beam splitter, by the way, is not at all fundamental. It merely 
serves to expose two detectors to the same light field at zero space and time 
difference. For measuring correlations between intensity fluctuations at 
different times at the same space point, one may just as well use a single 
detector. It gives identical results. 

The subject has developed greatly since the days of these early experiments, 
and statistical considerations now play a central role in many experiments 
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which use intensity fluctuations, for example, in spectroscopy and high resolu- 
tion interferometry. 

Theoretical consideration of the statistical properties of the general 
optical field has in recent years led to an improved framework for the theory 
of optical-field correlations or ‘“‘optical coherence”. The particular case of 
laser fields has, of course, received an enormous amount of attention, and 
my final introductory remark concerns the development of the theory of the 
laser. The laser appeared at an opportune time in theoretical physics, when 
one of the most interesting and active areas has been the development of 
techniques for dealing with irreversible processes. A notable date here was 
about fifteen years ago when van Hove published his work on the deriva- 
tion of quantum-mechanical master equations. Since these early papers, a 
great deal of progress has been made, and a vast literature now exists on the 
general subject of quantum mechanical systems interacting with thermal 
reservoirs, or quantum statistics. We can also include here the progress in 
finite-temperature many-body theory. When the quantum mechanical system 
is a radiation field interacting with non-relativistic states of bound electrons, 
the subject is variously called quantum optics, quantum electronics, statist- 
ical optics or photon statistics. It perhaps could be differentiated from the 
established discipline of quantum electrodynamics (even in its non-covariant 
low-energy form) by the emphasis placed on mixed (statistical) states of 
coupled electron~Maxwell fields and on the influence of dissipative processes, 
rather than on the pure states of the coupled fields. The progress in this 
theoretical area has thus been directly applicable to the laser problem, where 
the treatment of dissipation has been crucial to success. 

To review the subject, I shall first spend some time showing how one 
describes an optical field from the theoretical point of view, placing particular 
emphasis on the statistical description. Following this, I shall discuss how 
one describes such a field from the experimental or detection point of view. 
After these general considerations, detailed results will be given for particular 
types of field, namely, thermal fields, laser fields and mixed fields. In the final 
section I shall discuss the practical applications; these make up the relatively 
new subject of statistical spectroscopy. 


1. THE DESCRIPTION OF A LIGHT FIELD 


In some ways the theoretical techniques required for this subject are generali- 
sations of quantum electrodynamics, since in this field the statistical operator 
or mixed states are rarely mentioned. The generalisation, however, is not 
fundamental, and to a far greater extent the theoretical apparatus required is 
a very restricted subgroup of the known results for the coupled electron- 
Maxwell fields. 
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I shall not try here to cover exhaustively all the theoretical papers which 
have been written on the subject, but try instead to extract from them the 
essential details of theories which are directly applicable to experiment. This 
is a much easier task, but only so because of the efforts of many individual 
workers. I shall mention here particularly the work of Professor Glauber of 
Harvard University and his students, Professor Haken of Stuttgart and his 
students and co-workers, Professors Lamb and Scully of Yale University 
and M.1.T., and Dr. Lax of Bell Telephone Laboratories and his co-workers. 
It will be impossible to mention individually all contributors to the subject 
and I shall append general references covering the work of the above- 
mentioned authors.“ > ® 7) An excellent pre-1965 bibliography is given by 
Mandel and Wolf. 

It will perhaps help at this stage to look more closely at a typical experiment 
before embarking on the necessary theory to describe it. In any experiment in 
optics there exists a source which emits the radiation, the radiation itself 
which carries energy from the source through some medium which may absorb 
or scatter (including any optical instrument, for dispersion or diffraction), and 
finally one or more detectors which absorb the energy of the radiation field 
in some way which renders it visible. In order to discuss such optical experi- 
ments on an intelligent and fundamental level we, as always in physics, 
consider simplified situations where the (theoretically) inessential complica- 
tions of the real world are removed as far as is possible. For a detector then, 
for instance, we postulate an ideal device which renders visible, without 
noise or distortion, the absorption of energy from the optical field. There 
are two features of the real world which this ideal detector must incorporate, 
in order to be a useful approximation to a real detector. First of all, because 
of the relatively high energy of the optical photon, the output of the detector 
is not continuous. A real detector does not absorb power from the field 
continuously, but only in units of one quantum of energy. The second fact 
is that all the incident power cannot be detected with 100% efficiency. We 
therefore assume that our detector has a random chance y of detecting each 
quantum. Consider an experiment in which we set up a source, apparatus 
and a single detector. The output of the experiment, which we have to 
describe theoretically, is then a train of single events in time corresponding 
to the instants at which quanta of energy were absorbed from the field. We 
do not usually think of the output of an optical experiment in this way, for 
two reasons. Firstly, in many experiments so much energy is taken from the 
field that the energy of a single photon is small by comparison, and secondly 
the area presented to the field by the detector 1s in most cases so large, that 
it should really be considered to be a large number of independent detectors 
in parallel, each of which is absorbing a train of energy quanta. Basically, 
however, the raw data of optical experiments is in the form of a sequence of 
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events, and to discuss such experiments accurately in other terms, for 
example, in terms of a classical continuous detected light “intensity”, is 
possible only in the types of limiting situations mentioned above. From our 
typical experiment, therefore, we have, perhaps on an oscilloscope screen, a 
train of pulses in time. How are we to interpret them to compare with theory? 
Each individual pulse has clearly little significance and we thus need to look 
at collective properties such as, the mean rate over a certain interval of time 
(the closest one can get to a classical definition of intensity), any bunching 
properties, any periodicity, etc., in other words only the statistical properties 
of the sequence of events are useful. 

The fullest theoretical description, therefore, of optical experiments must 
be fundamentally statistical in nature, predicting such properties of distribu- 
tions of photodetections. Less complete theories have heretofore been 
sufficient, but for the reasons outlined in the introduction, interest is now 
focussed on these more fundamental questions. 


1.1 BASIS STATES AND INTERACTION WITH MATTER 


With this motivation, we proceed to an elementary theoretical introduc- 
tion. It will suffice, in exposing the basic results of the subject, to consider 
in detail the interaction between the free radiation field and a single bound 
electron only. The free radiation field without either sources or detectors 
is of trivial interest. The bound electron, for most purposes, may be con- 
sidered to have only two active energy levels. The effects of many such 
electrons will be simply additive. The radiation field itself may also, for the 
purposes of clarifying basic issues, be considered to consist of energy in a 
single mode, that is to say, we concern ourselves solely with the occupation 
of a single cell of the radiation phase space, and we may proceed a long way 
by taking the special case where this is resonant with the electronic frequency. 
We treat the interaction non-relativistically, in the electric-dipole approxi- 
mation. With these simplifications we may clearly neglect any consideration 
of longitudinal and scalar photons, and since no essential detail is lost we 
may also confine the theory to one state of polarization; we can take the 
polarization direction, for simplicity, in the direction of the electronic dipole. 
Thus we are left with the radiation field of one transverse photon. The inter- 
action between this field and the electron is sufficiently weak for us to use 
the stationary states of the uncoupled fields as a convenient basis set for the 
description of the combined system. The coupling will then give rise to 
continuous transitions between these states. At any instant, therefore, the 
occupation of these basis states describes the state of the interacting system. 

Let us denote the two possible states of the bound electron by the ket 
vectors |b>, where b can be b, or b, designating the upper and lower levels, 
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respectively. The radiation field will be specified by the infinite set of states |n)>, 
denoting the number of quanta of energy present in the mode volume. The 
direct product of these two spaces gives the basis set |r> = |n, b> for the 
combined system. The second quantized operators bt and b and at and a, 
will create and annihilate the electron and single-photon states respectively. 

In this representation, the complete quantum-statistical specification of the 
state of the system is given by the density operator 


P = )Prsl 7><S! (1) 


or, equivalently, by the set of density-matrix elements 
P,, = <r |P|s>. (2) 


If P,, factorises (P,, = P, P,*), then we have a pure state; otherwise, we have 
a statistically weighted mixture. The general problem requiring solution is, 
given P,, at time zero, find P,, at time ¢. Generalized master equations will 
provide the tools for this task. Statistically stationary states will be found in 
which P does not change with time, implying that the total transition rates 
into and out of any given basis state exactly balance. 

The interaction Hamiltonian is simply, in second quantized form, 


V = g[b,* b, a+b," b, at), (3) 
where the coupling constant g is given by 
1 ha \* 
u (= 2] ’ (4) 


ut is the electronic dipole moment and Q the mode volume; antiresonant 
terms have been neglected. To obtain the state of the radiation field alone, 
we reduce the total density operator P over the electron states, by simply 
summing over b, and b,, to obtain a reduced density operator p. Likewise 
a reduced density operator o may be found for the electron, by summing 
over the states of the radiation field. 

To include dissipation in general forms of the theory, the electron system 
is coupled to a heat bath, which may be at a positive temperature (damping) 
or at a negative temperature (pumping). Similarly the photon field is coupled 
to a reservoir which causes decay. However, photon statistics, sufficiently 
general for this presentation, will result if we confine the external influences 
to a mechanism for maintaining, on a time scale short compared with varia- 
tions in the radiation field, fixed probabilities o,,, and o,, for the occupation 
of the upper and lower electron states respectively, and a “window” in the 
volume Q, which lets out photons at a constant rate, thus damping the 
radiation field. The former mechanism is required to describe thermal 
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radiation statistics. The addition of the latter mechanism, as the temperature 
of the electron system is raised through infinity to negative values, changes 
the photon statistics to those characterising laser fields. 

I have mentioned that the above representation is a convenient one. It is 
not of course the only possible one and although I shall use it myself in this 
chapter, I must mention here that the most widely used alternative represen- 
tation uses an over-complete set of the “coherent” eigen-states of the annihi- 
lation operator of the radiation field, 


la> = exp (wat — ata)|O> 


= exp(—4al?) LaF In). (5) 


So that 
ala> = ala>. (6) 


As shown by Glauber (see his chapter in this volume), unique expansions 
in this representation are possible, although, as might be expected, manipula- 
tions must be carried out with some care; the treatment of quantum noise, 
for instance, although possible, and favoured by a number of eminent 
authors, seems much more difficult than in the number representation. This 
is only to be expected, in fact, since, as I will mention below, the representa- 
tion is more useful for the classical limit. 

The feature of this representation which is most important, is the con- 
venience with which operators in normal order may be evaluated. The 
representation also makes the classical limit very easy to obtain; this becomes 
apparent when van Hove- or Pauli-type master equations are transformed 
into the « representation, since they then take the form of Fokker—Planck 
equations for an apparently classical distribution function P(«). The function 
is not, of course, a classical one in general, and in some cases does not exist 
in any sensible class of functions. I have not time to go into these interesting 
questions of formalism but refer you to recent work of Glauber and Cahill, 
in which the whole subject of quasi probability distributions and the relation 
between the classical and quantum domains is very expertly treated. 


1.2 TIME DEVELOPMENT 


To return then to the general problem posed above, namely, given the 
initial state of the light field and the electron, how does the system evolve? 
The required time development can be found in many ways. Traditionally 
light absorption, for instance, is treated by degenerate secular perturbation 
theory and Fermi’s golden rule. Kelley and Kleiner“ use the Liouville- 
Neumann expansion of the time-development operator. Mollow“” uses 
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Hamiltonian equations of motion for the individual second-quantized field 
operators. The Langevin equations or Fokker—Planck equations of some 
laser theories could even be applied for this purpose. In this chapter I should 
like to bring to your notice the recent progress which has been made in 
perturbation theory, in connection with the density matrix itself. The von 
Neumann equation of motion of the density matrix might be called the very 
basis of quantum-statistical mechanics, and very early in the history of the 
subject, Pauli?) conjectured that equations connecting diagonal elements 
only and having a gain-loss structure typical of a Boltzmann equation should 
exist in certain circumstances. 

Thus one would initially know the occupation probability of a given level, 
and as time proceeded, this would be depleted by transitions into the other 
levels and augmented by similar reverse transitions, at known rates dependent 
on the perturbation. Such an equation is known as a master equation and, if it 
can be found, it clearly simplifies the whole structure of quantum-statistical 
perturbation theory, for any given problem. Indeed in many areas of physics, 
the Boltzmann equation is used continually for the most detailed practical 
calculations. The question of the possibility of constructing such an equation, 
by a fundamental derivation using the von Neumann equation, has been 
the subject of many papers in recent times. Apart from van Hove, whom I 
have already mentioned, notable work has also been published by Prigo- 
gine,“%) Swenson) and Zwanzig.’*) The present situation is very clear 
but perhaps not yet sufficiently generally appreciated, due to the very technical 
methods used for most of this work. I can summarize it by stating that 
equations of such a form can now be written down for all elements of the 
density matrix which, under the sole condition that the initial density matrix 
is diagonal, are as general as the von Neumann equation itself. They amount, 
in fact, to no more than a reformulation of the machinery of quantum 
mechanics. Thus for instance, for diagonal elements, we have the prescrip- 
tion for degenerate states (Pike“)} 


Prt) = Pr()+Y + [Ma (t—1V1P(t)-Polt de, 
where 


— jf)? 
Ma(®) = ~2(-i)V Vert PY (— Ig Ves Vn Hes Vag) +010.) 


The off-diagonal terms also have simple expansions of similar form, but now 


+ Equation (56) of Pike“® applies inadvertently to degenerate perturbations only; eqn (60), 
however, is completely general. 


PHOTON STATISTICS 137 


include energy-level differences. It is not difficult, using these formulae with 
a small density of states spread h(E), introduced in the usual way on to either 
the radiation mode or the electronic levels, to perform the integrals over 
energy and time, and obtain the result 


P,,(t) = Py, 0) + ahot ¥) VisVer [Pos (0) — Pre (0)] + O(V*). (9) 

This will hold for a time short compared with the transfer rates; hy denotes 
the value of h(E) at the resonant energy. 

I refer to my own paper quoted above,“ for details of these master 


equations, but can give here a simple method for deriving them. 
By definition 


H= LE, Ig><ql +2 Veg |P><Q| 
P,,(t) = Tr {plr><s)}- 
Using the equation of motion 
& (rel) = ELH, Ir><st, 
we obtain 


i =iTr p | Eur<al—E,lrd<sl+ DY Var (a> <sl-Y Veg ><a}. 
q qa 


Using the notation 
E,, = E,—E,, 
and absorbing diagonal elements of V,, into the E,, so that 


V,, = 0 for all r, 
we have 


OP,,(t’ : , , ; 
ee mi [Bs Palt)+ LVePalt— & VeaPrlt} 
t q#r q#s 


We multiply by the integrating factor exp (~iE,,t’) and integrate from 0 to ¢ 
which gives, for an initially diagonal state 


P,.(t) — P,,(0) 6,5 = if’ exp [—iE,, (t'—1)] ( Y Ver Past'\— ¥) VegP ra (')) di’. 
0 q#r qQ#s 


Iteration of this equation, with continuous withdrawal of the diagonal terms, 
gives the required result. Integration over a band of final-state energies, by 


138 E. R. PIKE 


the same physical arguments as in the derivation of the “Golden-rule” 
equation for transition rates, gives a linear t-dependence, and indeed the 
resulting master equation is just the extension of the Golden-rule formula 
for the elements of the density matrix, when starting from a mixed state 
instead of a pure state, See also a recent paper by Pike and Swain.®) 

We shall find that use of the master equation gives a unified approach to 
the problems of emission, absorption, generation of thermal light, and 
generation of laser light below, through, and above threshold, which I shall 
discuss in later sections. 


2. THE DETECTION OF A LIGHT FIELD 


2.1 EMISSION AND ABSORPTION 


One may consider the bound electron in the general theoretical situation 
described above, as acting as a detector of the field. To describe an absorp- 
tion detector, which is the more usual kind, we withdraw the electron from 
contact with the heat bath, populate the lower state only, and then watch 
the development of its reduced density matrix o as a function of time, under 
the action of the field. To be quite accurate, we should not allow the upper 
state to become appreciably populated, since emission would then take 
place back into the field and it would be partly an emission detector. We can 
therefore imagine the upper state to be greater than some ionisation energy, 
and that a mechanism is supplied which will withdraw the excited electron 
from the system. This process will constitute the ‘‘counting” of a photon. An 
emission detector can also be achieved both in theory and in practice, where 
now the upper state only is populated initially, and the effects of its depletion 
are measured. A He-Ne laser working at 0-6328 ym will conveniently detect 
infra-red radiation from a similar laser at 3-39 um in this way. It will even put 
it out completely if it is sufficiently intense, and we have found this a conve- 
nient way of performing laser transient experiments. External damping of the 
radiation field is not essential for studying these processes. 

The master eqn (9) of Section 1 very quickly gives the usual answers for 
the probabilities of emission and absorption. The matrix elements are: 


Vor = gn, b, | b,' b,a+b,' b, at | n, b>. 
The first term only gives non-zero contributions when 


n,=n,—1, b=5, 
b, = b, 
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The value of this contribution will be g./n,. The second term will give a 
contribution when 


nh, =n, +1, b, = 5, 
b, = by 


with value g./(n,+ 1). 


We first consider absorption. The simplest initial conditions at t = 0 are 
that the field is in state |n> and the electron in state |b,); we wish to find the 
occupation probability of |b,>, that is, the value of the diagonal element 
O4,, after a time At. Summing over the radiation-field states we have, using 
P,,(0) = 0, 


Oy,(At) = » Pe bu Mr Bu (At) =. th, At >; Vis Vor Py (0). 


Ny, S 
From the above calculation of the matrix elements V,,, this reduces to 


Ou,(At) = nhoAt >> Vib ech t pl? Pees by n-+1 b, (O)s 


The only non-zero contribution in the sum over n, occurs when 
n,+1 =n, 
since this is the only state populated at t = 0. We have then finally 
6,,( At) = ng? hon At. (10) 
Similarly for emission 
o,,(At) = mg7ho(n+1) At. (11) 


Figure 3 shows the contributions in diagrammatic form. All diagrams 
associated with master-equation perturbation theory are linked without 
further manipulation, and always begin and end with the same state. 


Emission Absorption 
ne n-1 “A 
“say TL = AL U 
n-1 u n ae 
7 uot 
wee L 7 Peat u 
n~ n-1 
Oult) = 1g7ho (n+1)t Out) = 1g? ont 


Fic. 3. Diagrams of emission and absorption. 
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2.2 SINGLE COUNTING STATISTICS 


Now an absorption detector placed in a beam of radiation consists of a very 
large number, N, of such absorbing electrons, in fact, so many that although 
the probability (eqn (10)) of each individual atom undergoing a transition is 
very small, nevertheless, the entire number of photons would be absorbed in a 
time of the order of 1/ng?h )nN. However, the number of detecting atoms 
that can be used in practice is limited, firstly because the electrons in the 
upper state must be collectable and, secondly, the electrons must not decay 
back to the lower state, within the thickness of the detector. In practice taking 
into account the excited, but lost, electrons and the photons which pass un- 
absorbed through the detector, the ratio, 4, of the number of successful 
“counts” to the number of incident photons is unfortunately no more than 
a few parts in a hundred (Foord et al.“”). The photon-counting distribution 
p(m, At), therefore, reproduces the photon-number distribution (assuming 
that the source, field and detector are in a steady state, and that photons 
absorbed in the detector are continuously replenished by the source) save 
for a random attenuation 7. . 
Thus using Bernoulli’s distribution (Scully) 


—{n ad 
pom, bt) = 5 ( ens" (=n Bt)" Paw (12) 
The most useful way to interpret theoretical calculations of p,,, by experiment 
is to construct the normalized factorial moments n“ of the photon-counting 
distribution. These are defined as 


! r 
n= = p(m, At) / | mpm, as)| (13) 
Using eqn (12) above 
m! n! 


nh = s. 


mG Pm | 


p mn! (yn At)” 


yy na An" Pan (14) 


Thus the normalized factorial moments of the photon-counting distribution 
are identical with the normalized factorial moments of the photon-number 
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distribution. This remarkable result allows us to treat the photon-counting 
distribution as though it were exactly the photon-number distribution, 
independently of detector efficiency and beam attenuation, so long as inter- 
pretation is always made via the normalized factorial moments. Sometimes 
the normalizing denominator is omitted, in which case the function is denoted 
by N®. (See also Pefina’s chapter in this volume for further discussion.) 

If the field has a coherent-state representation, the photon-counting 
distribution may be found by substituting for the diagonal element of the 
density matrix in the number representation (Glauber“!®)) 


Pan = [= lal?" exp (—|a2) 2a, (15) 


in eqn (12) above. 
The result, after a little manipulation, is 


(n ae ie 


p(m, At) = [Pe ) exp (~n Atla|2) d2a, (16) 


This formula was first derived by Mandel in the classical case. For clas 
sical fields |«|? (=I) is the field intensity. For long sampling times the 
quantity |a|? At is replaced by 


E(T) = I, <alat ala> dt. (17) 


The matrix element in this integral can be a function of time; this requires a 
little discussion. The ensemble distribution p,,, can be converted to a station- 
ary time distribution while still discussing the single-mode case, provided 
that the time scale of variations (coherence time) is long compared with 
the optical frequency. A spectrum analyser with a suitable Q factor would then 
show a very sharp line which varied in amplitude at different times. The 
mathematical description is the same as a Fourier analysis on an infinite 
time scale, which would give a description of the same field in terms of 
multiple modes: 


E*(r, 2) = > (sha,)* u,(r) exp (— i,t) a, and c.c. (18) 
k 
In this description 


E(T) = [ee t)E*(r, t)> dt (19) 
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The factorial moments can easily be expressed in terms of a by substituting 
eqn (16) in the definition, eqn (13). We obtain 


n© = <lal7>/<lal?>’. (20) 


The factorial moments of the photon-counting distribution are thus propor- 
tional to the actual moments of the “intensity-fluctuation distribution” 
P(1). The same holds for “integrated” photon-counting distributions using 
eqn (19) above. 

Discussions of n, the second factorial moment, or the closely related 


_ (2 1—<n) 
(An?) |ny? (=! ee ) 


played an important role in the early days of the subject. Its departure from 
unity, representing the value for shot noise, was then called excess noise and 
was the quantity usually measured. The first full photon-counting distribu- 
tions reported with modern fast counting equipment were by Johnson et 
al.2 at the Puerto Rico Quantum Electronics Conference in 1965. With 
such very accurate distributions, moments as high as n°) have been success- 
fully compared with theory, for thermal and laser sources (Jakeman et al.) 
The first publication presenting moments higher than the second was by 
Freed and Haus.” 25) 

Both the photon-counting distribution and its factorial moments may be 
derived from a single generating function Q(s) (Glauber). This is defined 
as the average over the photon-counting distribution 


Q(s) = } (1—s)™ pm, T). (21) 


A simple calculation shows that 


_cp@ 
p(m, T) = = a) 7 (22) 
nO =(-1y 208)| (23) 
ds” s=0 
and 
Q(s) = <exp (—nsE))>. (24) 


The generating function is thus the Laplace transform of the intensity- 
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fluctuation distribution. It is also given directly by the relation 
Q(s) = y (1 —ns)" Pun (25) 
h 


in the limit of very small T. 


2.3 MULTIPLE COUNTING STATISTICS 


If the transition probabilities of the detecting atoms are essentially indepen- 
dent of frequency over the range of frequencies used, the same type of 
arguments, as were outlined above, can be used to provide straightforward 
results for the multiple detection of photons in many modes, at a set of 
different space-time points x,. The simplest experiment of this type is to 
measure the expectation value, <m,(x,, Ax,)m(x2, Ax2>, for m, counts 
to be recorded over Ax, jointly with m, counts over Ax,. If we make these 
intervals the same and small but non-overlapping, we find that 


<m, m2) = (9 At)’ CE” (x1) E" (x2) E* (x2) E*(x,)>, (26) 
which we denote by 
<m, m> = (yn At)? G(x,, x2). (27) 
If the intervals overlap completely, on the other hand, we have 
<m,"> = (n At)? G?(x;, x1) + At G(x, x,). (28) 


These formulae will be discussed in the last section. 

The notation G(x, ... x2,) was introduced by Glauber”) for the rth- 
order correlation function of this type. The functions G and N™ are closely 
related (Glauber). In fact, if the coordinates x, ... x2, are all the same, 
then the two functions are identical. Simplifying the expression (26) above, 
when x, and x, have the same space point and when the fluctuations of the 
field are stationary, we get the autocorrelation function of photon-counting 
fluctuations at a single detector 


<m (t, At) m (0, At)> = (y At) G(a), t> At 
= (n At)? G20) + nAtGMO), 1t=0 


This quantity may be measured directly and is of great importance in 
intensity-fluctuation spectroscopy of thermal light sources, as I shall explain 
in the next section. Its Fourier transform is also often measured _and is called 
the intensity-fluctuation spectrum. 

To conclude my discussion of the detection of optical fields, I shall make a 
few remarks concerning experimental apparatus and techniques. An ideal 
detector, in the sense described above, is essentially available in the modern 


(29) 
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photomultiplier tube. Fortunately, an optical photon (~1 eV) is quite an 
energetic particle compared with room temperature (~j;eV), and in 
the last few years, due largely to the work of Eberhardt’?°) at I.T.T., optical 
detection has become virtually noiseless. Figure 4 shows the characteristics 
of such a tube. A large number of electrons (~107) arrive at the anode 
of the tube when one photoelectron is released at the cathode, with a spread 
in this number which is closely predictable from the theory of cascade mul- 
tiplication (Woodward), The transit-time spread in this particular tube 
is large (~ 15 ys) but faster versions are available. Light fluxes of the order 
of one photon/min can be detected (Oliver and Pike”), 

Most experimental work in photon statistics has been concerned with 
the measurement of G(z) for spectroscopic purposes. In most cases this 
has been done simply by analysing the current output of the detector, using 
a scanning electrical filter (wave-analyser), to obtain the intensity—fluctuation 
spectrum. Less frequently a similar experiment in the time domain, that is, 
a scanning delay, has been used to measure this autocorrelation. This is the 
method of delayed coincidences. Although popular and simple, neither 
method is very efficient, since a parallel bank of filters, or delays, if feasible, 
would clearly give an extremely large gain. This extra information is avail- 
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Fic. 4. Characteristics of the I.T.T. FW 130 photomultiplier tube. (After Foord et al.“7), 
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able if a full correlation can be performed, that is, if a signal is progressively 
delayed and multiplied point by point with the original in a more or less 
continuous fashion. The difficulty here is the multiplication, which is norm- 
ally too costly to perform. A great step forward, however, was made in the 
related microwave problem during the Second World War by van Vleck 
and his research assistant Middleton,?@® when they showed that the auto- 
correlation function of a “hard-limited” Gaussian signal was just 2/7 times 
the arc sine of the original. Hard-limiting means giving the signal a unit 
amplitude when above a fixed “clipping level’’, usually the mean, and zero 
when below. Multiplication then involves only zeros and ones and can be 
much more easily performed. 

In my own laboratory my colleagues and I have recently extended these 
methods to optical work. This has made intensity fluctuation measurements 
much easier and has greatly improved the accuracy of such measurements. 
The method which we have found most practical has been to call m(0), 
O or 1, (m,(0)), according to whether it is below or above an integer clipping 
level k, and to cross correlate this with the unclipped value m(z). The ana- 
logue of the van Vleck—Middleton formula is then (Jakeman and Pike®?%) 
for thermal light 


<m,(0) m(t)> Be 1+k (1)7-\42 
er ecm La i. Igr@l’, (30a) 


where g™ is a normalized form of G“(z), equal to G (1)/G (0). 
A simpler scheme is to clip at k = 0 and autocorrelate the clipped signal 
with itself. The result is? 


<mo(0)mo(t)> _[, , 1-m im? 
ae [1+ FS Por] |} - ae @r]. (30b) 


The above results are derived by making use of an extension of Glauber’s 
generating function Q(s) to two variables, by averaging over the joint dis- 
tribution p(m, m,). The function in this case is‘? 


Q(ss’) = {77ss'T1— |g)? ]+0(s+s')+ 1}. (30c) 


In the sections below on laser statistics and applications, I shall give some 
examples of the use of clipping. G®(0) i.e. N®, and higher factorial moments 
N® of photon-counting distributions have also been studied by a number of 
groups, including my own, and detailed considerations of statistical accuracy 
in periodic sampling of different types of light source have been made 
(Jakeman and Pike®”). The basic point to emerge, as might be expected, 
is that the statistical accuracy in the determination of intensity fluctuations 
by photon-counting methods is limited by the ratio of the total time of the 
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experiment to the “coherence time” of the light, and not necessarily by the 
ultimate cycling speed of the sampling system. The hardware used for 
obtaining photon-counting distributions at the present time ranges from 
very expensive sampling and decoding electronic systems cycling at tens of 
megahertz, to simple methods utilising ordinary nuclear pulse-height ana- 
lysers some three orders of magnitude slower. The original experiments of 
Freed and Haus‘??) were a further three orders of magnitude slower. They 
used a mechanical tape punch to provide a store. Another few orders of 
magnitude takes us back to Rutherford and Geiger’s study of the statistics 
of radio-active decay, performed in 1910.°) They depressed a key by hand 
on to a moving paper ribbon to record counts of photons viewed by eye from 
a fluorescent screen. I shall give examples of measurements of factorial 
moments in the next section. 


3. SPECIAL FIELDS 


3.1 PHOTON STATISTICS OF THERMAL FIELDS 


The simple theory of the interaction between a single bound electron and 
a single-mode radiation field which I have given above will again suffice, 
taken to first order only in the interaction, for the discussion of thermally 
generated light fields. In this case, the electron is the generator of photons 
instead of being the detector, as in the application in the previous section. 

The master equation reads as above (eqn (9)) but the populations a,,, and 
6, are now taken to be in thermal equilibrium at a positive temperature T, 
that is, 


C. 
= exp (—hw/kT). (31) 
Oy 
Thermal fields 
n " n ! 
a on 
/ = / 
n+ 1 { \ n-1 / u Tuy “heu/kT 
\ ' a. € 
N \ I 
= XN 
~ 
n u n | 


Fic. 5. Diagrams for thermal-light generation. 
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The two diagrams of Fig. 5 now give the only contributions and we have, 
tracing this time over the atomic coordinates, and summing over the number 
of atoms, N, 


Pnn (t) = Prn (0) + 2g7h Nt [(n+ 1) (Pn+ 1,nt+t Gn — Pan Fun) 
+7(Pn-1, n—1 Fun Pan Oy) ]. (32) 


A similar equation and its generalisation, which we shall introduce below 
for the laser problem, has formed the basis of much of the work of Lamb 
and Scully, who derived it by the Liouville-Neumann iteration for the 
density operator, but without performing the energy integrals to give the 
linear t-dependence. 

The equation may be written in the “adiabatic” approximation in which 
the atomic populations can be considered as unchanging. 


d nn 
= = —A(n+ 1) Pant C(n+ 1) Pn+ 1,nt+17— CAP yn + ANPy— 1, n~-1, (33) 
where 
A = 197i Nu, (34) 
and 
C = 1g7*hy Noy. (35) 


We look for solutions of the form 


Pan = y Prn exp (—A,,f). (36) 


For finite, non-zero solutions, 
Ao = 9 (37) 
and 
An = (C—A)n. (38) 
The solution is, when no photons are present at t=0, 


C—A (so oy. 


C—A exp [—(C—A)t] \C—A exp [—(C— Ayr] 7 


Pan(t) = 


The time dependence shows how the distribution decays to the stationary 
state. A,(=1g*hyN(o,—6,,) is essentially the Wigner—Weisskopf inverse 
linewidth. 
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In terms of the mean value of the occupation number, nv, the time-inde- 
pendent solution is the familiar geometric distribution 


zh 


n 


= ——___., 40 
Pun (nA = 1)"* 1 ( ) 
where we have calculated that 
n= — (41) 
= I = ° 
0 hey Pan exp (ha/kT)—1 


This, of course, is known as the Bose factor. The same distribution seems to 
result in all orders of perturbation theory. The generating function for this 
distribution is 


1 
= ——_—_.. 42 
Xs) 1+nAt iis 2) 
The factorial moments are 
N® = (yAtay r!. (43) 
The photon-counting distribution is 
(nAt n)" 
AD = 44 
p(m, ) (nAtn+1)"*? ( ) 
The field has P-representation with the intensity—fluctuation distribution 
1 
P(\a|”) = ———exp (—|a]|?/<Ja|?>). 45 


I should not need to emphasise that the fact that a field has a positive and 
well-behaved P-representation, such as the Gaussian one above, does not 
imply that it is a classical field (Glauber“®), This is clearly shown by the 
thermal field above, where the factor unity representing spontaneous emission 
(quantum noise) was essential for the calculation. The point has been the 
subject of some controversy (Mandel and Wolf), 

Examples of light fields having the same statistical properties as thermal 
fields can, in fact, be generated by imposing a Gaussian amplitude modulation 
on a coherent laser beam by passing it through a rotating ground-glass 
screen. These are so-called ‘“‘quasi-thermal” sources, other examples of 
which I shall discuss more fully in the later section on scattered laser light. 
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Intensity fluctuations, showing the Bose character, were obtained a number 
of years ago in this way.C%3” 

Figure 6 shows an early result from R.R.E. which appears in Professor 
Kastler’s revision of Bruhat’s text ‘“Thermodynamique”. Here quasi- 
thermal and laser photon-counting distributions are directly compared on 
an oscillogram. 


Fic. 6. Oscillogram of photon-counting distributions of quasi-thermal and laser light. 
(Reproduced from Bruhat “Thermodynamique,” Masson, Paris, 1968) 


A “single-mode” thermal field can vary in time as discussed above; the 
amplitude-modulation side bands which result, form an optical spectrum. 
On an intermediate time scale of Fourier analysis many adjacent modes, 
each varying in amplitude, can be considered. The field will also vary in 
space. The correlation functions G“ introduced by Glauber can be used to 
characterise the statistical properties of a general superposition of thermal 
modes. The N“ derived above are the values of G® for zero coordinate 
differences. 

The optical spectrum, for instance, of a stationary field is defined as the 
Fourier transform of the autocorrelation function mentioned above, 


G™ (r, 1) = <E (r,t) E*(r, 9). (46) 
The intensity-fluctuation spectrum is the Fourier transform of 


G(r, t) = CE“ (r, t) E-(r, 0) E*(r, 0) Et (r, T)>. (47) 
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On the other hand, spatial correlations may be investigated at a fixed time; 
for instance, by measuring the function 


G(r, 1,0) = <E7(r,, 0) E™ (12, 0) E* (rz, 0) E*(r,, 0)>. (48) 


The full range of arbitrary rth-order correlations are, in principle, open to 
measurement. The values of and relations between such correlation func- 
tions are the basic problems of photon statistics. For thermal (Gaussian) 
fields it can be shown‘?*+® that 


G(r, 1) = [6 (7, OP +1E™ (x, 2)? (49) 


and further that?® 


GO(x,...x9 = ¥ [] CP xp xpr4-p)- (50) 


Perms j=1 


The rth-order correlation function for Gaussian fields is a symmetrical sum 
of products of first-order correlation functions. 

The interpretation of the G® as multiple photon-detection probabilities 
shows that such factorization properties imply some degree of independence 
of photon-detections at different space-time points. If all such events were 
independent, the field would be completely coherent (lack of knowledge of 
the photon number implies gain of information concerning the conjugate 
phase variable). The highest degree of factorization possible for a thermal 
field isS® 


G? (0... X53.) = 7! a & (x;) ile (x,). (51) 


Such a field has been called ideally incoherent, first-order coherent, or 
weakly coherent.0” 

This factorization has been confirmed experimentally up to G® for a 
chaotic “quasi-thermal’’ field produced by scattering a laser beam from 
polystyrene balls undergoing Brownian motion in water.*!) These results 
are shown, together with the results for an ideal laser, in Table I. The 
factorization of the joint photon-counting probability G® given by eqn (49) 
permits a symmetrical optical spectrum to be found by correlation of photon- 
counting fluctuations, using eqn (29) above. This is the basic theory of the 
experiments of Forrester et al.,“? and Hanbury-Brown and Twiss.” We have, 
at a single point 


<m (zt, At) m(O, At)> = (nAt)’ {[G™ (0)]? +1G™ (I. (52) 
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TABLE I 


Normalized factorial moments for coherent source? 


(After Jakeman ef al.?») 


Normalized factorial 


moment Experiment 
n@) 1-0000 
n®) 1-0002 
n@ 1-0001 
n 1-0006 
n© 1-003 
n® 0-994 


10’ samples 


Theory 


1-0000 

1-0000 +0-0009 
1-0000-£0-0017 

1-0000 +0-0030 
1-000 +0-012 
1-000 +0036 


Normalized factorial moments for incoherent source 


Normalized 10° samples 
factorial 
moment Experiment Theory 

nO 1-000 +0-036 1-000 +0.038 
n®) 2:00+0°15 2°00+0°16 
n® 6:05+0°85 6:00+0°85 
nh 24°745°9 24.0+5:°7 
n©) 130+45 120+49 
n© 805 +461 720+510 


9 X 10° samples 


Experiment Theory 
1-000 1-000 
2:004 2:000+0°017 
6°05 6:00-0:09 

24°7 24:0+0°6 
130 120+5 
805 720+54 


a Similar results were presented at the Conference CI07 in Paris, May 1966, but 
have not previously been published. 


The requirements on the extensions in space and time over which the above 
formula is valid are known as coherence conditions, and are simply derived 
in any experimental situation with thermal light, by geometrical and spectral 
parameters. The factorization is, of course, used conversely to determine 
these parameters, by varying the spatial size and sampling time of the 


detector. 


The effect of changing the interval At to a finite value T; comparable with 
the coherence time of a Gaussian source with Lorentzian spectrum of half- 
width I (= 1/T.), is given by Jakeman and Pike, (unpublished) 


g2kT, T) = 1+ 


(cosh y— 1) 
2? 


g(0, T) = n® = 1+y—4y?+ 


2, exp (—2y) 


exp(—2ky), fork > 0 (53) 


Fe > for k =O, (54) 


where y = T/T,, and the small g™ is a normalized version of G”, formed by 
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dividing by [G"]". These results are shown in Fig. 7. The time scale on the 
figure is for scattering of laser light from a protein molecule, but the form of 
the curves holds for any chaotic source with Lorentzian spectrum. The 
normalized second factorial moment n‘”? tends to two as the sampling time 
is reduced to zero, and to one for infinite sampling time. 


Sampled Gaussian Lorentzian fight 


Photon statistics 


1000 


Fic. 7. The normalized second-order correlation function, g@(z), (t=T), for integrated 
thermal light. 


The effects of finite sampling time on the photon-counting distribution, 
which one can see in n) in Fig. 7, have been discussed in a semiclassical 
formalism in the limit of large T by Mandel,@® and more recently by 
Glauber,” Bédard and very fully by Jakeman and Pike.¢ 

The fluctuations are averaged out as the sampling time T reaches and 
exceeds the coherence time T.. This effect may be used to measure the width 
of a spectrum of known shape.¢”? 


3.2 PHOTON STATISTICS OF LASER FIFLDS 


There have been a very large number of publications on this subject. 
The approaches used may be classified into three groups: 


(1) use of a Langevin-type equation; 
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(2) use of a quantum mechanical Fokker—Planck equation; 
(3) use of the equation of motion for the density matrix (master equation). 


Clearly I have not time to go into detail in reviewing all these papers. 
(A review by Lax” of his own numerous papers ran to three hundred pages 
in a recent Brandeis University summer school!) I shall try to pick out, 
however, the essence of all this work as it applies to actual statistical experi- 
ments on laser light. Well below its threshold, the laser is the same as a 
thermal source, and its photon-number distribution is the geometric distri- 
bution; n@ takes the value two, due to the typical fluctuations discussed 
above of a field in equilibrium with a thermal reservoir. This would hold 
independently of temperature for all positive temperatures. Well above 
threshold, however, a laser produces Poisson photon-counting statistics; 
n” takes the value unity and the fluctuations are damped out by the satura- 
tion mechanism of the oscillation. Between these two limits, the statistical 
properties of the steady state, in particular, the behaviour of n® and py, 
have been given by Risken,‘*?) by Lax and Louiseil* and may also be 
found in Scully, Lamb and Stephen.“ The statistical properties of the 
non-steady state have also been widely studied by the same workers and 
others, 

The effects can again be predicted using the simplified theoretical model I 
have already used twice above. This time the master equation has to be 
written to higher order and the effects of the loss of photons from each 
state |m> at a rate Cn must be taken into account. In the simplest case the 
(pumping) reservoir is assumed to depopulate completely the electron 
lower state. Thus at ¢ = 0, o,, = 0. The theory is not much more difficult 
if a finite occupation of the lower state is allowed but this does not add any 
significant extra physics. 

The master equation for the time development, with N atoms in the 
field, is 


Py (1) = P,,(0)+ tho Nt y. {Vs Ke 


+4(tho)* dF (—4V,,, Kees V 


S28 
SiSz 


Ver + 6Ves, Vais Vosg V, 


SS2 Sor 


—4V,, Vis, Vea Vi) [ P,,(0) — P,,(0) ] oe a4 f 
—Ctn, Pnen, + Ct (n,+ 1) Pnyp+i, npt+1° (55) 


The last term, representing a loss from the n, + 1 level, is equivalent to a 
gain in the ,th level. The same two terms could be arrived at by postulating 
absorbing atoms, but the lack of non-linear terms is then embarrassing. 
The diagrams which give contributions to the fourth-order terms are shown 
in Fig. 8. These are easily evaluated, and although these alone would give a 
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satisfactory model for lasing action over a wide range of operation, a slightly 
more satisfactory result is obtained when the complete series of higher-order 
terms is summed. The diagrams required are simple generalizations of those 
in Fig. 8, and the final result when traced over the electronic levels is the 


equation 


Laser fields 
n u n u n MT) 
S nt+1 ears 
vo oO ae L 
/ / 
n+t | L n4+ty 1 / Typ=0 
\ \ / 
/ ” 
Ban Nes nf / 
n u i 
\ 
n u -- \ N u 
V4 , ae 
\ 
{ \ 
ale L x 
\. S L 
~ 
n4+1 ss. 
n 
i) n j 


Fic. 8. Diagrams for fourth-order atom-field interaction. 


dp By"! 
Gt = —AQnt [L4G DT Pm 


By)-! 
+An| Ln] Pn—1> n-1 — CAPyy, + C(n+ 1) Pn+i,nt1> (56) 


where 
B = (ngho)*A. (57) 


The equation is due to Scully and Lamb.“ 
We again look for solutions of the form 


Pun = >, Pan EXP (Amt). (58) 
The stationary-state solution corresponds to 2) = 0. It is“>? 
" A B car! 
= Z!} —|{1+—k 59 
Ps i a a | (59) 
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Z being the normalization constant. Further evaluation gives 


(A?/BC)" (A/B)! 


ee a a EE, EE 60 
oe [n+ (A/B)]! (17; (1; (A/B)+1, A?/BC)] Sa 

In the a-representation an exactly equivalent form is given by“?® 
P(|a|*) = Z~? exp (—|a|?) (<[oe]?> — Joel?) !41?2-9 (61) 


and the first term of an expansion of this, about the mean value, gives the 
expression 


P(\a|7) = Z~1 exp (- er +a 5 ) (62) 


found by Risken,'*?) Fleck,“*7> Hempstead and Lax‘¢®) and Lax and 
Louisell.*») This distribution has the desired properties of approaching the 
geometric distribution as A/C — 0 and the Poisson distribution as A/C > oo. 


He-Ne laser 
photon statistics 
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Fic. 9. The normalized second-order factorial moment, n® (= g@ (0)), of the photon- 
number distribution of a single-mode gas laser as a function of output power, showing the 
effect of resolving time corrections and finite sampling time. 
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Its lower normalized factorial moments have been evaluated to compare 
with experimental photon-counting results, and good agreement is found. 
The first experiments of this nature were by Freed and Haus?) and by 
Armstrong and Smith,“® where n was compared with theory. The develop- 
ment of highly stable single-mode lasers has made such measurements now 
very easy, and much more accurate comparisons have been made by Pro- 
fessor Arecchi’s group®® in Milan, Professor Korenman®! and his col- 
leagues at the University of Maryland and my own group at R.R.E.©?” 
Instrumental effects become important at the few percent accuracy level 
and corrections must be made for the finite ratio of detector resolving time 
to sampling time.‘°?” This is shown in Fig. 9 where I have shown both the 
raw and corrected data for n® using a Spectra-Physics model 119 laser. 
For sampling time T as large as 2 xs a theory of integrated laser fluctuations 
is required. This problem has been treated by Lax and Zwanziger.”® 
Experiments with several detectors have been performed by Davidson and 
Mandel.» 

The higher eigenvalues /,, and the corresponding p™, have been calculated 
by Risken and Vollmer©” and by Lax and Louisell,**) and thus the value of 
G(r, t) can be compared with theory. In Fig. 10 is shown a three-dimensional 
picture of this function as the laser varies from well below. to well above 


He — Ne LASER 
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Fic. 10. The normalized second-order correlation function, g(t) for a single-mode gas 
laser. 
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threshold. The curves were constructed from experimental measurements ob- 
tained at R.R.E., by the method of autocorrelation of clipped photon-counting 
fluctuations mentioned above.t In Fig. 11 are shown the corresponding 
intensity-fluctuation spectra using an ordinary wave analyser. 

The calculations of Risken and Vollmer‘** predict that Gz) should 
consist of a dominant single exponential below threshold, but that appre- 
ciable fractions of several faster exponential decays contribute in the region 
immediately above threshold. McCumber®* and Mezrich,‘°® however, 
predict essentially a single exponential decay in the threshold region. All 
theories give the ‘‘line-narrowing” at threshold, characteristic of second- 
order phase transitions. The role of order-parameter is played by 2—n'?, 


INTENSITY FLUCTUATION SPECTRA 
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Fic. 11. The intensity-fluctuation spectrum of a single-mode gas laser near threshold. 
The traces shown correspond to the coordinates at their centres, 


+ The method is valid for arbitrary statistics at low count rates. 
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which is unity in the ordered (lasing) state and zero in the disordered (thermal) 
state. It has been realized for some time in the study of phase transitions and 
in transport theory, that the simple assumption for a correlation function, 
made almost universally, viz. 


<a(t) a*(0)> = <|x(0)|?> exp (—Ar) (63) 
may be unrealistic, since many different decay rates may contribute. Here 
in the laser, we have a very attractive system for detailed investigation. 
Arecchi et al.°’) have made spectral measurements which support the 
analysis of Risken and Vollmer. In my own group, in addition to spectral 
analysis, we have employed the new methods explained above for obtain- 
ing the correlation function directly. In Fig. 12 is shown an autocorrelation 


1-0 Autocorrelation of laser near threshold 


dite 108 samples 
I = : 
ir 2:7 
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= Single exponential 
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Fic. 12. The autocorrelation function g‘*(rt) of the photon-counting fluctuations of aa 
single-mode gas laser at 2:7 times threshold output power, measured by the method of 
single-clipped autocorrelation (SCAC). 


at 2-7 times threshold power which is in the critical region to test the theory. 
It is quite an accurate measurement, some 10® delays having been sampled. 
Preliminary analysis of such results shows that a valid confirmation of 
Risken and Vollmer’s results will be difficult, since the departure from a 
single exponential decay is extremely small. With regard to experimental 


PHOTON STATISTICS 159 


technique, it is necessary to perform photon-counting experiments simul- 
taneously with a spectral analysis of intensity fluctuations, to fix the threshold 
position absolutely. This is not necessary when investigating the auto- 
correlation, since G‘)(0) immediately gives the position with respect to 
threshold. 

The experiments I wish to mention finally in this section on laser statistics 
are laser transients. The equation of motion for p,, has been integrated 
numerically for transient conditions by Scully,“ and the results show how 
the laser oscillations build up from noise below threshold. Arecchi et al.” 
have performed such experiments by Q-switching a laser with a Kerr cell 
inside the cavity, and have successfully compared the results with the theory. 
The results are shown in Fig. 13. 
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Fic. 13. Photon-counting distributions of a single-mode gas laser in transient operating 
conditions (After Arecchi et al.). 


3.3 PHOTON STATISTICS OF MIXED FIELDS 


Having discussed the case of thermal fields and laser fields, we come now 
to the consideration of superpositions of the two types. We take the laser 
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to be operating well above its threshold, so that its counting statistics are 
Poisson. It then acts as a “local oscillator” against which the thermal field 
can be heterodyned. Many workers have studied the statistics of such fields 
and in Table II is given a bibliography showing the results obtained to date. 


TABLE II 


Photon counting statistics of Gaussian—Lorentzian and coherent 
fields—a bibliography (After Jakeman and Pike.(*®)) 


Field Limit O(s) P(E) p(n,T) N® N&) 


present present 
paper paper 
present present 
paper paper 


T>0 present present present 
; paper paper paper 


IT +oo present present present present present 
@g= po paper paper paper paper paper 


| 
Tr>0 G66 R45 —s_- G66 L65 G454 
Wq4 = Wo P67 L65 G66 
| Se L654 
on <E>=0 H64 J68a°~—s« BG 64 B66° B66° 
J68a° J68a° 
<E.>=0 G65 R45¢ M59 R454 R45¢ 
= M64 M58 G66 
: L654 
sae et TENE -.. E>=0 G65" J68a M59" Pus6¢ G65 
T>o H64 MP65 MS58¢ 
J68b! ~—s- G65 


(E>=0 G65 R45 —- G63 R452 R454 
M64 = M64 G65 G65 


---- Incoherent field £;. 
Coherent field E,. 


Mathematically equivalent calculation. 

Not formal expansions in 1/y. 

In these early papers bunching was thought to be a property of photons. 
Recurrence relations only. 


Roca 
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e Calculated numerically. 

f To second order in 1/y. 

G_ Glauber, R. J., Phys. Rev, Lett. 10, 84 (1963); Jn “Quantum Optics and Electronics” 
(eds. C. de Witt ef al.), p. 65. Gordon and Breach, New York (1965); Jn ‘‘Physics of 
Quantum Electronics” (eds. P. L. Kelly et al.) p. 788. McGraw-Hill, New York (1966). 

R_ Rice, S. O. Bell. Syst. tech. J. 24, 46 (1945). 

L_Lachs, G. Phys. Rev. 138, B1012 (1965). 

H Helstrom, C., Proc. Phys. Soc. 83, 777 (1964). 

J  Jakeman, E. and Pike, E. R., (a) J. Phys. A. 1, 128 (1968); (b) J. Phys. A. 1, 627 (1968). 

B Bédard, G., Phys. Rev., 151, 1038 (1966). 

M Mandel, L., Proc. Phys. Soc., 72, 1037 (1958); Proc. Phys. Soc. 74, 233 (1959); Proceed- 


ings of the 3rd Conference on Quantum Electronics, Columbia University Press, 
New York (1964). 

Pu Purcell, E. M., Nature, Lond., 178, 1449 (1956). 

MP McLean, T. P. and Pike, E. R., Phys. Lett. 15, 318 (1965). 

P Pefina, J., Phys. Lett. 24A, 333 (1967). 


The table also includes the limiting single-field cases for reference. This 
table is taken from a recent paper on the statistics of heterodyne detection 
(Jakeman and Pike®®), For the general case where the laser frequency, 
@,, differs from the centre of the thermal-field spectrum, wo, and where this 
spectrum is Lorentzian and of arbitrary half-width [T (= ¥y/T), only the 
generating function and the second factorial moment of the counting dis- 
tribution have been obtained. The latter has the value 


1 exp (—2 
24, SSBL—2) 


1 
N® = {n,>? (1+ Sais 
y  2y? 2y? 


)+2.<m><n.) 
1 2 exp(—y){(y? —Q7) cos Q—2yQ sin Q} 
a 


2) 2(Q? —y’) 
aap ary | +" ie 


where 7; is the incoherent and n, the coherent signal and 
Q = (@,—@o)T. (65) 


The behaviour as a function of y is shown in Fig. 14 for the particular para- 
meters given. Of more practical interest is the autocorrelation function or 
the intensity-fluctuation spectrum of a mixed field. It can be shown that 
under reasonable conditions the complete optical spectrum of a source with 
arbitrary statistical properties can be measured using a heterodyne reference 
beam. This is well known as a radar Doppler technique and the principles 
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are identical. It has thus a great advantage over ordinary intensity-fluctuation 
spectroscopy (which is also a well known radar technique) in that the 
spectrum can be asymmetrical, or of arbitrary shape, and is immediately 
available from the experimental Doppler spectrum. The ordinary intensity- 
fluctuation spectrum has other advantages, however, which will be discussed 
in the last section in relation to the measurement of the structure of a 
turbulent fluid flow. 
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Fic. 14. The excess normalized second factorial moment, n®—1, (=N®/<{n>?—1), in 
the heterodyne case, for the values of R (=| @o—qa|/I) shown and for¢n;)/(<n;> + (ne) = 


0-1. The broken lines gives the value for the case where the coherent signal is absent. (After 
Jakeman and Pike.©®) 


4. PHOTON STATISTICS OF SCATTERED LASER LIGHT 


4.1 SCATTERING BY A RANDOM MEDIUM 


We consider in this section scattering of a coherent laser beam from a ran- 
domly inhomogeneous medium. For simplicity we take the medium to be 
isotropic. 

The scattering medium is considered as a scalar field, and the inhomo- 
geneity is described by fluctuations in the local dielectric constant e(r, f). 
This description is possible because the wavelength of optical radiation is 
long compared with interatomic spacings. The object of a scattering experi- 
ment is to determine the spatial and temporal correlations of this field. 
These may be written conveniently as 
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C(X4 12. X2q) = (S87 (x;) ... 587 (x,) 58*(x,4) ... de* (x5)>. (66) 


The question to be asked then, is how are these correlation functions related 
to the optical field correlations G™ of light scattered from them? The 
basic scattering problem was solved by Einstein,©® Brillouin” and Landau 
and Placzek.°)) 


Let us assume an incident field (taking a polarized component), 
Eo(r, t) = & exp [i(Ky .r—@ f)]. (67) 


We assume the scattering volume is > 4°, and that the dielectric constant 
contains fluctuations about a uniform value, ¢, 


(r,t) = &9+de(r, t) (68) 
The scattered field can then be written 
E,(r, t) = E,(r, t)+E,(r, t) (69) 


and Maxwell’s equations give in successive orders 


> Eq 0? 
V*Eo— 3 aa E =0 (70) 
Ey 0? 1 @ V(V . de Ep) 
VE, — 3535 = 3 Zz (6e E)— ——__, (71) 


0 


The solution at a point R in the far-field limit is found by a Fourier trans- 
form to be 

@o” Eo sing 
4nc? R 

It is linearly polarized in the plane containing k, and Eo. In this formula the 


direction of the point R determines the wave vector k, of the scattered light 
and hence also the scattering vector K which is given by 


E, (Ry, t) = o 


exp [i (k,.R—w, 1)] de(K, 1). (72) 


K = k,—kp. (73) 


@ is the angle between the polarization direction of E, and R. We have 
therefore, 


E*(R,, 1) « de* (K, ). (74) 


Thus a given scattering direction picks out a given spatial Fourier com- 
ponent of the perturbation of the dielectric constant. We form the corre- 
lation function 
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<E  (R,,, ty) E (R. ty) ewe E’ (R,,, ty) E* (Ry... b+ i) eee E*(Rx,,» ton)> 
oc <5e7 (K,, t;)de7 (Kz, t2) ...68~ (Ky, t,)58* (Ky 15 fp 41) ++ 68* (Kan ton): 
(75) 


This relates the correlation functions G” of the light field to the correlation 
functions C” of the medium (Bertoletti et a/.).“”? The first-order correlations 
are of the most direct interest, and in many cases can be calculated by trans- 
port theory. From the remarks made in the previous section it should be clear 
that Doppler spectroscopy offers great advantages here, since when feasible it 
gives exactly <de™ (k, t,) de* (k, t,)> for any arbitrary form of this function. 

We can see this by the following analysis. The electric field E(t) at the 
photocathode of the detector in a Doppler (heterodyne) experiment is made 
up of a reference beam 


E,(t) = &, exp (— ip t) (76) 
plus the scattered beam 
E(t) = E, (K, ¢). (77) 
Thus 
E(t) = E,(t)+E,(¢). (78) 


The measurable output of the photodetector, assuming that the scattered 
field is statistically stationary, is 


G(t) = <I(0) I(t)> 
= ((E, (0) +E, (0))(E, (+E, (DE. 
+E,* (t))(E,*(0)+E,*(0))>. (79) 
To simplify this expression we take the case where the complex dielectric 
constant fluctuations de(K, f) are Gaussian with zero mean. This assumption 


can be justified in many practical cases. 
In these circumstances the expression reduces to 


G22) = 1,2 421,660) +1, [6(a) exp (—iwot) +¢.c]+G,2(z). (80) 


When passed through a low-frequency reject filter, this signal gives the 
first-order field correlation function G,“(t), from which C,“(t) is immedi- 
ately available 


C(t) ~ <ée7 (K, t) de* (K, 0)). 


If the reference beam is removed, the correlation function G®(t), under 
the same statistical assumptions, has the same statistical properties as a 
thermal source and hence we may use the relation given above, 


G(r) = [EPO]? +E’, (81) 
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to deduce the first-order correlation from the auto correlation of the photon- 
counting fluctuations, or the intensity-fluctuation spectrum. I shall give 
examples of the use of this relation for such studies. 

Measurement of higher-order correlations, particularly temporal ones, are 
not difficult but have had little application as yet. They may possibly prove 
to be useful for studying scattering in critical phenomena, where Gaussian 
statistics may not hold. 


4,2 APPLICATIONS: INTENSITY-FLUCTUATION AND DOPPLER SPECTROSCOPY 


The great forté of the new methods in optical spectroscopy discussed in 
this chapter is that they allow spectral measurements to be made well beyond 
the limits of resolution of conventional optical instrumentation. For an 
optical interference effect to be used, we must have light paths differing by 
the order of a wavelength; if a resolution of 1 part in 10'*, say, is required 


cilnAcoluct ak =m 
ledrcalealod.. - 


See a ae eS 


Fic. 15. Scanning Fabry—Pérot oscillogram of a single-mode He—Ne 0:6328 um laser line. 
One cm on the screen equals 5-0 10-* cm~'. (After Jackson and Pike, J.Phys. E., 1, 394, 
1968). 


these paths would be about 10°m. Even allowing a gain of 100 for multiple 
reflection such path lengths are not practicable. In Fig. 15 is shown the 
spectrum of a single-mode laser obtained at R.R.E. with a 1m long Fabry- 
Pérot interferometer working at a finesse of 150. The line width is instru- 
mentally limited at 1 MHz and represents an optical resolution of only 
~ 2 in 10°. With scattered radiation even 200 MHz resolution is difficult to 
achieve. In Fig. 16 the Rayleigh and Brillouin lines of benzene are shown at 
approximately this resolution (see also Jackson et al.°*)). We see that if 
phenomena taking place on a time scale less than megacycles per second 
are to be studied by light scattering, using the intrinsically narrow (few Hz) 
lines from laser sources, then the techniques of statistical optics must be used. 
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Strictly we should not include Doppler spectroscopy as a statistical tech- 
nique since, although the intensity fluctuation of the mixed field is measured, 
the results are independent of the statistical properties of the source. We 
include it, however, as an example of the use of intensity-fluctuations. We 
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Fic. 16. Brillouin and Rayleigh spectrum of benzene by Fabry-—Pérot etalon with digital 
data storage, showing the decay times of acoustic photons and of entropy fluctuations. 


should now ask, conversely, what are the limits of intensity-fluctuation 
spectroscopy? Here one is limited solely by the response time of existing 
detectors. The fastest photomultiplier tubes available have pulse rise times 
something less than one nanosecond. Beat frequencies greater than a GHz 
will thus be averaged out. Thus there is a range of phenomena, having time 
dependencies between, say, 1 MHz and 1 GHz, which can be studied by 
either technique. We have already seen above, in the results of experiments 
on laser statistics, the study of fluctuations in the microsecond region by 
statistical spectroscopy. We have already mentioned the work of Forrester 
et al.) which was in the microwave region. The mercury discharge lamp 
has been the subject of study by statistical methods by a number of workers 
(Hanbury-Brown and Twiss,°?) Rebka and Pound, °*) Brannen et al.,° 
Twiss and Little,“°® Johnson et al.,°*°) Morgan and Mandel”), The width 
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of such a line (~200 MHz) lies at the high end of the frequency scale for 
statistical techniques and it is usually better to make such measurements 
using optical methods. The results of the last-quoted paper obtained by 
single-channel coincidence are shown in Fig. 17. In the penultimate reference 
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Fic. 17. Mercury arc lamp photon coincidences. (After Morgan and Mandel.‘°7?) 


the measurement was made by the photon-counting distribution. Measure- 
ments become progressively easier but less feasible by optical means as 

we come down in frequency. Before leaving the applications of these tech- 
niques to natural sources, however, I should remind you of the most famous 
and perhaps most important application in this category, which is the 
Narrabri stellar interferometer (Hanbury-Brown®®), This is now well in 
operation and can measure stellar diameters down to about 5 x 1074 
second of arc. 

In Fig. 18 is given an example of laser Doppler spectroscopy at 60 MHz. 
This is Brillouin scattering from an acousto-electric oscillator (phonon- 
maser). This Doppler laser-scattering application‘’”? has allowed a measure- 
ment of the soundwave flux inside the oscillator to be obtained, which is 
extremely difficult to achieve by any other method. 
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From Professor Cummins’ laboratory at John Hopkins University, | 
give an example” of the use of intensity-fluctuation spectroscopy in the 
study of transport parameters of CO, near its critical point. Typical results 
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Fic. 18. The Brillouin spectrum of an acousto-electric oscillator by Doppler spectroscopy. 


are shown in Fig. 19. This very beautiful work and similar work in Professor 
Benedek’s laboratory at M.I.T., particularly by Saxman,” has provided a 
great experimental move forward in the subject of the study of critical 
phenomena. The method used in general is to analyse the intensity-fluctua- 
tion spectrum of the Rayleigh scattered light using a wave analyser, although 
Cummins” has used an analogue correlator. I should say that a wealth of 
original experimental work in this new area of spectroscopy has come from 
these two laboratories. 

Another example where statistical spectroscopy has made an impact on a 
subject where optics previously was of limited application, has been in the 
study of turbulence (Goldstein et al.,’?) Pike et al.,{7?) Bourke et al.”). 
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In Fig. 20 is shown an experimental arrangement for the study of the struc- 
ture of turbulent flow of a fluid by intensity-fluctuation spectroscopy. The 
spectrum of the scattered light contains ‘“Doppler-shifted’” frequencies 
corresponding to the different velocities present in a “‘coherence volume”. 
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Fic. 19. Rayleigh linewidth along critical isochore for CO, by intensity-fluctuation 


spectroscopy. (After Swinney and Cummins, reproduced by permission of the American 
Physical Society from their Bulletin.) 
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Fic. 20. Apparatus for studying the structure of turbulent flow by intensity-fluctuation 
spectroscopy. 


170 E. R. PIKE 


This volume is controlled by geometrical factors and can be varied from 
small to large compared with the scale of the turbulence. Results obtained 
recently at R.R.E. are shown in Figs. 21 and 22. Interpretation of these shows 
that the velocity distribution is closely Gaussian; its width in the large- 
volume case agrees with previous determinations by hot-wire anemometry. 
For intermediate volumes, such work has shown that the scale of turbulence 
can be quantitatively measured. The method used was again spectrum analysis 
by wave analyser. 


VELOCITY DISTRIBUTION IN TURBULENT FLOW 


~-—DOPPLER SPECTRUM 


pee Salt tant a 
0 75 I5OKHz 


INTENSITY FLUCTUATION SPECTRUM——> 


pees 


0 75 ISOKH2 


Fic. 21. Doppler and intensity-fluctuation spectra of turbulent flow. 


My last example’) gives another illustration of the power of the method 
of single clipped autocorrelation of photon-counting fluctuations, known 
locally as SCAC. It is here applied to the determination of translational 
diffusion coefficients, D;, of protein molecules in aqueous solution, a subject 
which has been pioneered in the laboratories of Professors Cummins and 
Benedek. The apparatus is shown in Fig. 23. The diffusion constant controls 
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the Brownian motion which can be measured by determining the optical 
spectrum of scattered laser light. The theoretical prediction for a protein 
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Fic. 22. Linewidths of intensity-fluctuation and Doppler spectra of turbulent flow as a 
function of coherence volume. (After Bourke ef al.””) 
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Fic. 23. Apparatus for measuring the single-clipped autocorrelation of photon-counting 
fluctuations. (After Foord et al.*) 
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small compared with the optical wavelength is a single exponential with 
decay time 


Tt, = 4D;|K)’, (82) 


where K is the scattering vector. 

A typical result of a SCAC experiment is given in Fig. 24 for the protein 
bovine serum albumin. About 40 mW of He—Ne multimode laser light was 
focussed in the sample cell where a concentration of 1 mg/ml of the protein 
was suspended in a universal buffer solution of controlled pH. Light was 
collected at a scattering angle of approximately 30° from about 10% of a 
coherence area. The total photoelectron counting rate was 200 counts per 
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Fic. 24. The second-order correlation function, g™(z) of laser light scattered from the 
protein bovine serum albumin obtained by the method of single-clipped autocorrelation 
of photon counting fluctuations. (After Foord et al.“7*) 


second. An accuracy of 2% can be obtained in an experiment lasting one 
hour. The important difference made possible by the SCAC method over 
the wave analyser is that for a given experimental time, the concentration 
of the protein can be reduced in the former case by the ratio of the band- 
width of the wave analyser to the frequency range scanned. Nearly two 
orders of magnitude can be gained in this way. Since the diffusion constant 
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is dependent on concentration, this is a very important consideration. In 
Fig. 25 are shown some preliminary results we have obtained by varying the 
pH of the buffer solution. The protein can be seen to “uncurl”’, that is, its 
diffusion coefficient increases as the pH is reduced. Experiments along these 
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Fic. 25. The effect of changing pH on the diffusion constant (and hence size) of the protein 
bovine serum albumin using the apparatus of Fig. 23. 


lines have aroused the interest of a number of biologists; the relative speed 
and accuracy of the measurements open up new possibilities in the study of 
time-dependent processes which will no doubt prove rewarding in the 
future. 
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Quantum-Classical Correspondence for 
Stochastic Processes 


W. H. LOulIseLi 


The statistical properties of the radiation field in a laser and optical para- 
metric oscillator have been important in the understanding of these devices. 
Recently, the techniques developed to study the laser have been applied with 
gratifying success to the study of the noise properties of Josephson junction 
oscillators. Some of these methods are presented in this chapter. It is believed 
they will prove very useful in the future, in other areas of physics. 

Section 1 contains a brief review of several topics in quantum mechanics.“ 
In Section 2 the quantum theory of damping under the Markoff approxima- 
tion is presented. The concept of a reduced density operator is introduced 
which describes the statistical properties of a system coupled to a damping 
mechanism. This is applied to a damped mode of the radiation field. 

The concept of ordered operators as a means of developing a quantum- 
classical correspondence is presented in Section 3. This allows us to treat 
quantum stochastic processes in an equivalent classical domain. A quantum 
regression theorem due to Lax is also derived which allows us to convert 
multi-time averages to one-time averages for Markoffian systems.?* The 
spectra of the radiation field can then be determined. 

In the final section these techniques are applied to a rotating wave van der 
Pol oscillator. A single mode homogeneously broadened laser has been shown 
to be equivalent to such an oscillator. 


1. REVIEW OF QUANTUM MECHANICS 


1.1. EQUATIONS OF MOTION 
a. The Schrodinger Equation 


The Schrédinger equation in Dirac notation is 


0 t 
in EO” _ HMO», (1) 
where the Hamiltonian is Hermitian 


H = Ht, (2) 
177 
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When H is time independent, the solution of (1) is 


|W(t)> = exp (—7 Ht/h) |W). (3) 
This state vector is normalized to unity so that 
Wt) |W(t)> = 1. (4) 


b. Heisenberg Equation of Motion 


If py, is the probability of the system being in state yy and M is any system 
operator, the mean value of M is defined by 


<M(t)> = 2 Py w(t) | M | W(t)> 
= dP <W(0) | exp @Ht/h) M exp (—iHt/h) |y(0)>, (5) 


Y Py = 1. (6) 
m7 


where 


Here, <w(t)| M|w(t)> is a quantum ensemble average when the system is 
known to be in state w, while the second average over states w is due to the 
classical like uncertainty in the state w. If we define the Heisenberg operator 
as 

M,(t) = exp GHt/h) M exp (—iHt/h) (7) 
then 

<M(t)> = 2 Py <W(0)| M(t) |W (O)>. (8) 


From (7) we see that 


dM, (t) 


ih ae 


= exp (iHt/h) M exp (—iHt/h) H —H exp (iHt/h) M exp (—iHt/h) 


= [M,(¢), H] (9) 
which is the Heisenberg equation of motion for the operator in the Heisen- 


berg picture. 


c. Density Operator Equation of Motion 
We have the relation 


Tr|u><v| = <v| 4). (10) 
To prove this we may use any complete set of states such that 
Yi In><n| = 1. (11) 


Then 
Tr |u><v| = }) <alud<v|[ n> = V <v|n><n| ud = <v| ud. 
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This result allows us to re-write (5) as 


<M(> = Te {ME polvCor<veo)}. (12) 
We then define the density operator p(t) in the Schrédinger picture by 
p(t) = LPylWOrvO!, (13) 
so that 
<M(t)> = Tr M p(t). (14) 


M and p(t) are both in the Schrédinger picture. 
If we use (1) and its adjoint, it follows from (13) that 


20.» [nO 


it) |+ih |W) & = wo} 
=F HVOOVO! ~ WOXHOLH) 


= [H, p(t)] (15) 


which is the equation obeyed by the density operator. Again if H is time inde- 
pendent, the solution is 


p(t) = exp (—iHt/h) p(0) exp (GHt/h) = 2 Py YOO. (16) 


Therefore, by (3) and its adjoint we see that 


p(0) = d Py | ¥(0)><p(0)I. (17) 
Furthermore, by (10), (6) and (4) we see that 
Tr p(t) = 2 Py WOW) = 2 Py = 1. (18) 
We note next that 
Tr AB = Tr BA. (19) 


This follows if we use (11): 
Tr AB = ¥' (n| AB|n> = > <n] A| m> <m| BI nd 


=} <m|B|n><n| A|m> = Y <m| BA|m) 


=4Ir BA 
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By (14), (16) and (19), we see that 


<M(t)> = TrM p(t) = Tr M exp (—iHt/h) p(0) exp (iH t/h) 
= Tr exp (iHt/h) M exp (—iHt/h) p(0) = Tr My (t) p(0). (20) 


Therefore, traces may be evaluated in the Schrédinger or Heisenberg pictures. 


1.2. THE RADIATION FIELD 
a. Quantization 


The radiation field without sources may be described classically in mks units 
by the vector potential A(r, tf) which obeys the wave equation 


1@A 


2 shheaee 
V*AC(r, t) = “3 Bp (r, t). (21) 
The electric and magnetic fields are 
—0A 
,t) = —— 22 
E(r, t) = (22) 


B(r, t) = curl A. 


We may expand A in the form 


1 
A(r, t) = ae ) u(r), (23) 
where 2 
d 
= + oP q, =0 (24) 
v2 U, (r) + (2!) a (r) = 0. (25) 


When we solve (25) in a cavity with perfectly conducting walls, we obtain a 
set of normal modes. The g,(t) give the amplitude of the /th mode and obey 
the equation of motion of a simple harmonic oscillator. If we define 


dq, 
A(t) =P (26) 
then the electric and magnetic fields become 


E(r,1) = a3 Tact). (27) 
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Bar, t) = + d gq, (t) curl u; (r). (28) 


so that p,(t) determines the electric field and q,(t) the magnetic field. 
The normal modes of (25) are orthogonal and normalized to unity: 


{ u,* (r). u(t) dr = yyy (29) 


The energy of the field in the cavity is 


£0 n2 1 
H= —E t)+—B*(r,t)| d 
{ 2 ef) 210 |e 
cavity 


= py 4(p,? + @," 4,’), (30) 


which follows from (27-29). We may therefore visualize the electromagnetic 
field energy as residing in an infinite set of fictitious harmonic oscillators, one 
for each mode. 

To quantize the field, we merely quantize the oscillators. We assume p, and 
q, are Hermitian operators which obey the commutation relations, 


[41> Pm) = thd tm (31) 
If we introduce the linear combinations 
1 h 
a= Tong Oat iP) a= |(=)@+ 
or (32) 
at = a (0 - 10) pai l()@—-a 
for each mode, it follows from (31) that 
(4; Gm!) = Sime (33) 
The Hamiltonian operator (30) then reduces to 
H= d hon (a, a,+4). (34) 


b. Eigenvalues and Eigenvectors. 


If we restrict ourselves for simplicity to one mode, the Schrédinger equation 
(1) becomes 
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0 
ih— | W(t)> = ho(atat4)|W(n>. (35) 


Since the Hamiltonian is time independent, we may separate variables. Thus, 
if we let 


[W(t)> = exp @Et/h) |W0)>, (36) 
we obtain the eigenvalue equation 
haat a+4)|y(0)> = E|w0)>, (37) 


where E is the energy eigenvalue. 
Since (at a)' = ata, this operator is Hermitian and has real eigenvalues. 


We may write this as 
ata|ln’> =n'|n'), (38) 


where we label the eigenvector by the corresponding eigenvalue. 
To solve, we note by (33) that 


[a,ata] =a [at, ata] = —at (39a) 
or 
aaa=a(ata—1); ataat=al(ata+1). (39b) 


If we use these to operate on |n’>, we have by (38) that 
at af{a|n'>} = (n’—1) {a|n’D}, (40) 
at a{at | n'y} = (n’ +1) {at |n’y}. (41) 


Therefore, if |’) is an eigenvector of ata with eigenvalue n’, then a|n’) is also 
an eigenvector of a‘a with eigenvalue n’ — 1 while a‘ |n’> is an eigenvector with 
eigenvalue n’+1. We may therefore proceed to generate an infinite set of 
eigenvectors and eigenvalues from the assumed original one: 


|n'> a|n'> a’ |n'> 
n’ n’—1 n’—2 

|’ a‘ |n'> a’? |n'> 
n’ n+] n’+2 


We next show that n’ = 0, 1, 2, .... The length of a vector must be positive. 


Therefore, 
<n'|n'> > 0. 


If the equality holds, then | ’> = 0 which is uninteresting and trivial. There- 
fore, <n’ |n'> > 0. Next, the length of a| n’> is . 


<n’ |ata|n’> = n'<n'|n’> > 0, 
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where we again used (38) and the fact that (a|n'>)' = <n’ |at. Since (n'|n'> >0, 
then n’ > 0. Therefore, all the real eigenvalues n’ must be non-negative. The 
generated sequence n’,n’—1,n’—2,... must be terminated before reaching 
negative values. Therefore, n’ must take values 0, 1, 2, .... 

To normalize the vectors, we write 


alm) =c,|n—1). 
Then 
<n|ata|n> = |c,|? <a—1|n—1> = nXn|n). 


If we require that 
<n|[n> = 1 
<n—1|n—1> = 1, 


then aside from a trivial phase, c, = ./n and we have 


a|n> = /n|n—1> (n = 0, 1,25...) (42) 
In a similar way, we see that 
a'|n> = /(n+1)|n4+1 (43) 
and 
ata|n> =n|n> (44) 
and the energy eigenvalues are 
E, = ho(n+3). (45) 


It follows from (43) by repeated application of at to the vacuum or ground 
state | 0> that 


at 
Vit 


If the field is in the energy eigenstate | > with energy E, = hw(n+4), we 
may say that the cavity has 7 units of excitation hw. Each unit of excitation 
may be called a quantum or photon and then one says there are exactly n 
photons in the cavity of energy haw. When a operates on | 2), we have the state 
|—~ 1} with one less photon. We say that a “‘annihilates” a photon. Similarly, 
since at|n> = ./(n+1)|n+1), at “creates” a photon. 

It is not difficult to show that the photon eigenstates are orthogonal 


<n|m> = Om (47) 


[0> = |D. (46) 


and complete 
2, np<n| = 1 (48) 


where | is the identity operator. 
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c. Heisenberg Equations of Motion 


The Heisenberg equation of motion for a(t) is 
. da 
ih are fa, H] = ho[a, at a] = hoa, (49) 


where we used (34) and (39a). The solution is 
a(t) = exp (—iwt) a(0) = exp (iwtata) a exp(~iwtata), (50) 
where the last form follows from (7). Similarly, one finds that 


a'(t) = exp (iwt) at(0) = exp (iwta‘a) at exp (—iwtata). (51) 


2. QUANTUM THEORY OF DAMPING 
2.1. THE REDUCED DENSITY OPERATOR UNDER MARKOFF APPROXIMATION 


a. Arbitrary System 


Consider a system described by a Hamiltonian, H, a reservoir described by a 
Hamiltonian, R, and a weak interaction between the system and reservoir V. 
The density operator for the system and reservoir, p(t), in the Schrédinger 
picture satisfies the equation 


op 1 
—— = —[H+R+YV, p]. 52 
aR [H+R+YV, p] (52) 
In general, we are interested in mean values of a function, M, of the system 
operators only. This is given in the Schrédinger picture by 


<M(t)> = Tras M p(t) = Trs[M Trp p(t)], (53) 


where we have traced over both the system and reservoir. From this we con- 
clude that we never need all the information contained in p(t), but that we 
need only the reduced density operator for the system defined by 


S(t) = Tre p(t), (54) 


where we trace p(t) over the reservoir. S(t) is an operator function of system 
operators only. We would therefore like to remove the unnecessary informa- 
tion in (52) and obtain an equation of motion for S(t) directly. 

To proceed, it is most convenient to transform (52) into the interaction 
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picture. Physically, this removes the high frequency unperturbed motion from 
the equation of motion. To this end we let 


p(t) = exp = (H +Ry] x(t) exp Fe +R). (55) 


This represents a transformation from the Schrédinger picture, p(t), to the 
interaction picture, y(t). Since the system and reservoir are independent before 
coupling, we note that [H, R] = 0. In fact, all system operators commute with 
all reservoir operators. 

If we substitute (55) into (52), we find that x(f) satisfies the equation 


ae ee 
ih = [V(t), x] (56) 
where i a3 
V(t) = exp % (H +Ry| Vexp | 7 (H +R). (57) 


If we trace both sides of (55) over the reservoir and use (54) we have 


S(t) = Trp exp |; a+r x(t) exp Figarcu 


= exp (=H) s(t) exp (= it), (58) 


where 
s(t) = Trp x(t) (59) 


is the reduced system density operator in the interaction picture. 

We assume that the interaction is turned on at t = 0. At that time, the 
system and reservoir are independent, so that the density operator factorizes 
into the direct product 


p(0) = S(0) fo (R) = (0) fo (R) = xO), (60) 


where S(0) is the initial system density operator and fo(R) is the initial reser- 
voir density operator. The reservoir is assumed to be in thermal equilibrium 
at temperature Tso that 


exp (— BR) 
Trp exp (— BR)’ 
where B = (kT)~1, a Boltzmann distribution. In addition, we assume that the 


reservoir is so large that its statistical properties are unaffected by the weak 
coupling to the system. In other words, we assume the reservoir remains in 


fo(R) = (61) 
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equilibrium when it is coupled to the system. The assumption that the inter- 
action is “turned on” at ¢ = 0 is just an artifice to remove all statistical cor- 
relations between the system and reservoir initially. 

We return to (56) and integrate both sides from ¢ = 0 to f¢ subject to the 
initial value (60). This gives on two iterations 


1 : / , 
UO) =10)+ = | IVE). xO) a 


rn (4) i ae f dr” ve, [vie”), xe"). (62) 


If we proceeded to iterate in this way, we would obtain y(t) in a power series 
in the perturbation V(t) which is just a perturbation result. Unfortunately, 
perturbation theory is not adequate to yield exponential decay and we resort 
to another approximation. If we differentiate both sides of (62) with respect 
to t, we have 


= = [V(x] + (=) i [Vo.cve", xe"y)| ae”. (63) 


If we trace both sides over the reservoir and use (59) we obtain 
Os 1 \7f¢! ‘ , 
Sree) [ot [ro.c7e9.c01] ar, (64) 
ot ih 0 


where we have assumed that 
Trp V(t) x(0) = Tre x(0) V(t) = 0 


and (0) = S(0)f,(R). That is, we assume that the interaction has no diagonal 
elements in the representation in which R is diagonal. If V does have diagonal 
terms, we could re-define the system Hamiltonian to include them, so this is 
no real specialization. 

At this point we assume that V<H or R. If V = 0, the system and reservoir 
are independent and the density operator y(t) would factorize into the direct 
product x(t) = sf) where we assume the reservoir remains in equilibrium. 
Since V is small, we therefore look for a solution of (64) of the form 


X(t) = s(t) fo(R) + Ax(e), (65) 
such that Ay is small, of order V. In order that (59) be satisfied, we require 


Trp Ay(t) = 0, (66) 
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so that 
s(t) = Trp x(t), 


If we put (65) into the integrand of (64) and retain only terms of order V? 
on the right, we have 


o —-l ft 
#eF { ; Tr | VO, (vie), s(¢9 fo(®)1| dt’. (67) 


The reduced density operator s(t) determines the statistical properties of the 
system. Since s(t’) occurs in the integrand, the behaviour of the system in the 
future depends on its past history from t = 0 to ¢. We next make the assump- 
tion that the system is Markoffian which means that its future behaviour is 
determined by the present and not the past. This assumption is valid, since 
damping destroys knowledge of the past. Mathematically, we therefore replace 
s(t’) by its present value s(t) so that (67) reduces to 


= = = i Tre| VO, [Vi(t’), Of) dt’ (68) 


0 


which is called the master equation. 

The Markoffian approximation has been done very crudely for the sake of 
brevity. Actually, one should take a coarse grained average on a time scale 
which is long compared with the reservoir correlation time but short compar- 
ed with the system damping time. It should also be long enough to contain 
many cycles of the undamped system motion. We may then let the upper limit 
in (68) go to infinity. 

Next we assume the interaction energy between system and reservoir can 
be written in the form 


V = S: F i 0;, (69) 
where the F; are reservoir operators and the Q; are system operators. In the 
interaction picture these may be written as 

V(t) = D) Fi(t)Q; exp (ia; t) (70) 
When this is substituted into (68) and the double commutator is expanded, we 


find after minor algebraic manipulations that 


0 
= = 2 /0(@y-@)) {L0; 0;s—O;s Q,]wj ;-[Q;50,—s Q; Q1wj i}, a 
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where we have let 


wiy= [exp (—io, t) Trp Lfo(R) Fit) Fj dt 


(72) 
Wii= [. exp (— io, t) Tra fo(R) F; Fi(t)] dt. 


This master equation describes the statistical behaviour of a system coupled 
to a reservoir under the Markoff approximation. 


b. Driven Damped Oscillator 


Let us apply the above analysis to obtain the master equation for the reduced 
density operator in the Schrédinger picture for a driven damped mode of the 
radiation field in a cavity. The unperturbed Hamiltonian is 


i 
where @, is the cavity frequency. 
We take the interaction energy to be 
V=hY.(k,b,at + «;,* 6, a) (74) 
t 


where x; describes the scattering process. 
In the notation of (70), we have 


Q, =a" Q,=a=Q,' 
(75) 
Fi, => Kb, F, =) x,*5,' = F,'. 
q t 
In the interaction picture we easily see that 
T = al exp [iw,(t—to)] =@Q," 
Qi [iw,(t— to 2 (76) 


F,! = by Ky b, exp [—iw,(t—to)] = F,™. 
t 
Our first task is to compute the reservoir spectral densities w7, and w7j;. 
We shall calculate one in detail and leave the others as an exercise. We have 
by (72) and (76), 
foo) 
wi, = [exp Goze) F.@) Fidade (77) 


In the notation of (72), we see that 
D2 = —We. (78) 
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Also by (75) 
(F2(t) Fi>r = (y K;* exp (i; T) b,t 3 Km bn) 


(79) 
= Y K,* K» Exp (ia, T) {bb a> 
im 
But 
Trp exp (— BR) b,'5,, 
b,t a ich Me ae. 80 
< i bm>R Tr exp (— BR) ( ) 
where R = )' hw, b, b;. The reader may show easily that 
j 
<6,' ba>R = O1m Ay (81a) 
where 1 
i, = —_———__———_ 81b 
1 = exp (ho,/kT)—1 (St) 
Therefore, (77) becomes on using these results 
wh, = Db? m |” exp Ci(o,— cr] de, (82) 
fi 0 


where we have interchanged the order of summation and integration. Let 
g(w,) dw, be the number of reservoir modes between , and w,+dq,, then 
since we assume that these modes are closely spaced, we may change the sum 
to an integral 


» {jo {, dan, g(@,) { }, (83) 
so that (82) becomes 
wi, = | do, 9(@) |x(op? m1, (84a) 
where we let 
fe { exp [i(co,—0,}t] dr. (84b) 
Now 
iz exp (+ iz) dr = 28(Q) + iP = (85) 
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where # is the Cauchy principle part defined by 


Loe eo aay © 


Therefore, (84a) becomes 


0 1 
wh = | de, g(a) [m(a)? Hw) { x5(o,-@,) + 19 } 
0 @,;— O-, 
ms (z ” iso) (87) 
2 > 
where we have let 
y = 2ng(@,)|K(@,) |? (88) 
bio = — a [* Serato 
0 @,— @, 
and 1 
nt eo (89) 


exp (hw,/kT)—1° 


The imaginary part of wy, represents a small shift in the cavity frequency due 
to the coupling to the reservoir and in general may be neglected. 
The remaining w’s may be found by the same argument and are given by 


wi, = (t — ido) =W3i 


(4 + ido) (1+) = wi (90) 


Wh, 
wi, => wi, = 0. 
If we use these and (75) and (76) in (71), we find after minor algebra that 
os 


—— = -io,+ Aw) [a'a, S] + 4 [2aSat—ataS—Sata] 


+ yn [at Sa+aSat —ataS — Saat‘. (91) 


We see therefore, that the only effect of Aw is to change slightly the cavity 
resonant frequency w,. Aw is proportional to |x|? by (88), which is the small 
coupling coefficient. We may re-define w,+ Aq@ as w,’ or neglect it. All opera- 
tors in (91) are in the Schrédinger picture. 


QUANTUM-CLASSICAL CORRESPONDENCE FOR STOCHASTIC PROCESSES 191 


The first term in (91) represents the free motion of the cavity mode. The y 
terms represent the loss of energy from the system to the reservoir, while the 
yn terms represent the diffusion of fluctuations in the reservoir into the system 
mode. Note by (89) that as the reservoir approaches absolute zero, ni — 0. 

We may transform (91) to the interaction picture. If we let 


H’ = h(w,+ Aw) ata = ho,’ a‘a, (92) 
then (91) becomes in the interaction picture, 


os 

ee + = [2asat —atas—sata]+yn[a'tsa+asat —atas—saa*), (93) 

where all operators are in the interaction picture. We omitted the super- 

script I in the y and yf terms for simplicity since all these terms contain both 

an a and an at and we note that aa! = aal and no confusion is likely to arise. 
The master equation (93) may be solved by various techniques. 


3. QUANTUM-CLASSICAL CORRESPONDENCE”? 


The essential mathematical difference between quantum and classical mech- 
anics is the algebra obeyed. In quantum mechanics the variables do not neces- 
sarily commute and the order in which they are written down is important. By 
introducing the concept of ordered operators, we may set up a correspondence 
between the quantum system and a “‘classical’’ equivalent system. This allows 
us to separate the stochastic problem from the quantum problem. 


3.1. ORDERED OPERATORS—DISTRIBUTION FUNCTION 


Let us consider a complete set of non-commuting operators a,, a2, ..., dy in 
the Schrédinger picture which obey some set of commutation or anticom-~ 
mutation relations. The anticommutator of two operators is defined by 
[a;,a,]4 = a,a;+a, a; Suppose we have some function Q of these operators 
which may be expanded in a power series. We may use the commutation (or 
anticommutation) relations to re-order the terms in the function into some 
pre-determined or chosen order. Let this chosen order be aj, a2, a3, ..., Gy. 
Therefore, Q in chosen order becomes 


Q = O(a, asey ay) = 63 ares yy O*., coe OS a," a,"* eee a,” (94) 


rt ry 


We put a superscript c to indicate that we have put the function in chosen 
order which of course is equal to the function in the original order. Once we 


192 W. H. LOUISELL 


have put Q in chosen order, we may define an associated c-number function 
by means of 

Q° (chy, Ory eens Oy) = De eee Dr, ray onan rp a 2”? oe OZ", (95) 

r1 ry 
where we replace the operator a; by the c-number a; which is real or complex 
depending on whether a, is Hermitian or not. We put a bar to remind us we 
are now dealing with a quasi-classical or c-number function. 
We next define a linear chosen ordering operator @ by means of 


O° (ay, da, «-.5 4p) = C{O"(ae, «pf, (96) 


where @ tells us to replace each a; by the corresponding operator and write all 
terms in chosen order. We may also define the inverse operator ¢~?. 
We may give a formal representation of the @ operator by means of 


f 
LOS (Oy, 005 %p)} = | {OG veey Ey) IT 6@,—a) da;, (97) 


where the 5-functions are operators in the chosen order. In the integration we 
formally replace each a, by the corresponding a; and put all terms in the chosen 
order. But since this is exactly what ¢ tells us to do, the two expressions in (97) 
are formally equivalent. 

If the operator a, is Hermitian and the a, real, we may represent the 6- 
function by means of 


§(a—a) = — i exp [—i&(a—a)] dé, (98) 


while if it is non-Hermitian, the operator and its adjoint will both be present 
and we then let 


Hua) 5(a*—aty = | [ exp [-i(e—a)] exp [~#*@"—aly] 6, 9) 


if the chosen order is a, at. Here we integrate over the entire complex €-plane 
and d7é = d(Re €) d(Im 6). In this formal presentation we shall use (98) for 
simplicity. 

If we use (98), (97) may be rewritten as 


0°(a,,...,4,/) = ( : )’f-- fan 1+ Wop D°(a1, «005 &) 


2n 


x | wf dbs ... dé, exp [—i6 (a, —a,)] ... exp [—i€ (a, —ay). (100) 
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If we interchange the order of integration of the ¢’s and «’s, we may write this 
as 


O(a ay) = | -- [ abs dE exp (Eras) xP (pa) FE 89) 
(101a) 


where 


F(Eq, -003 65) = (=-) J wf da ... da, exp [—i(10, +... +85 ay) 
an x OF (a4, ...5%,). (101b) 


Therefore, (F£,,...,&,) is just the Fourier transform of the associated c- 
number function Q°(a,,...,%,). If we invert (101b) by taking the inverse 
Fourier transform, we obtain 


Fass nytt) = [exp Lira tn Ep a9] FlErs bs) As 
(101c) 


which gives us the associated function in terms of F. If we apply the @ operator 
to both sides of (10ic), we obtain (101a) exactly. 
Consider next the expectation value of Q 
<Q°(a,, ooeo as, t)> = Tr p(t) Q°(a,, sees a,). (102) 
If we use (101a), we have 
Oe sap t)y = fon [br dy CR nko DF Enns (103) 
where 
CE, «+ Sy, t) = Cexp (iC 1a, )... exp (ies a;)> 
= Tr [p(t) exp (i ,4;) ... exp (, a,)] (104) 
is a characteristic function and F(é,,...€,) is given by (101b) and is the 
Fourier transform of the associated function Q°(a,,...,a,). In (103) we 
have converted the quantum expectation value to an integral over c-number 


functions. We may express this in another way as follows. By (97) we have an 
alternate expression equivalent to (103) 


CQO°(Qy, -.5 Ap, t)> = | wf da vos Mot Doha, oy Xp) PoCO1, «2% prt), (105) 
where we have let 
P (04, --05%7,0) = (6(H1 — Gy) ... (yp —ay)) 
= Tr p(t)d(a,—a,) ... (a,—ay,) 


1 \f j 
™ (=) [fa 1 Ay exp [iE ay +... +05 ay) ee 
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and where we have used (98) and (104). We call P a distribution function which 
is the Fourier transform of the characteristic function. If we apply the inverse 
Fourier transform to (106), we have 


CE 1, Ey, t) - {fa ve Ah EXP (IC 0, +... +85 op) ] PCy, ..., &, £) 
= Trexp (i¢,a,) ... exp (iS ay) p(ay, ..., Ay, t). (107) 


The advantage of using the distribution function stems from the fact that 
expectation values of operators in chosen order are evaluated just like clas- 
sical averages if P, is considered as a classical probability. We may see this as 
follows. We have by (107) 


om trat.ury C 


OSs)" OCS)? ... OCS 


= (a,"a,"... a7’ 


é; = 0 


= eee ow," a2"? cee a? P(a4, sang Oy, t) da, eee de, (108) 


There will be corrections to this quantum-classical correspondence for mean 
values of operators which are not in chosen order. For example. 


{ay a,» F iB Ay X P.(%,; Oo, t) de, da, a <a, a2», (109) 


if a,, a2 is the chosen order. 


3.2, EQUATION OF MOTION FOR DISTRIBUTION sTuDy®) 


We may next use the above association scheme in order to transform the 
master operator equation (71) into an equation of motion for the c-number 
distribution function. From the definition (106), we have 


oP. oS .. 
a (a,¢) = Tr a? (a—a), (110) 
where we let Sf 
6°(a—a) => I] O(a; — a;). (111) 
i=1 


If we use (71) in the Schrédinger picture and the cyclic property of traces, 
(110) reduces to 


OP. 
ot 


=TrS (- [o*(a—a), H] 


> 6(@;,— ;) {[d°(o —a), Q;] Q; Wii —Q;[6°(a—a), Q;1w;;} ~ (112) 
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We may represent the 6-function as 


0 0 
5°(a—a) = ITexp (=a =] 5(a;) = exp |-a=| 5(a) (113) 
and (112) becomes 


aP, 1 a 
a =TrS [i [exe (-» Ge) ]-F Seon i) 


x | [exp (-»=), 0,| Q;wi;—Q; [exp (- a<).0,| win Ko 


=TrSL’ [“ a] 5(a), (114) 


Oa’ 


where LV stands for the operator argument of the trace which is a function of 
the system operators a,,...,@,. The Q; and Q; are usually one of these, or 
functions of them. Since L is a function of the a’s, we may in principle use the 
commutation relations to put it into the chosen c-order and 


’ 0 te g _ Tre a c 
u (sa) <2 (< a) =[Le(Z.8)ee-9ap, ais 
where we have used (97) to express L. Now by inspection of L’ in (114), we 


see that after putting it in chosen order, the exponentials in L’*{0/da, B} can 
be factorized out of all terms, so we may write it as 


bas (<8) aw by (=. 6) exp (- x). (116) 
When we use this and (115), (114) becomes 
oe (a,t)=TrS [r (= p| exp (-p =) 5(a) 5° (B—a) dp. (117) 


However, 3 
exp (—B =) 5) = 56-2, (118) 
Oa. 
so that (117) becomes 


oP. 7. ( 0 
a (@1)= Ts {L (<8) 50-2) (B—a) dB 


= [2°(.8) 30-9 P.6,0¢8, (119) 
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where we used the definition (106) for P,. If we carry out the B integration, we 
obtain 


t) = Ls (<2) P.(a, t). (120) 


This is the formal equation obeyed by the distribution function which we 
wished to obtain. In many cases of interest, in particular for a homogeneously 
broadened single mode laser, we may neglect derivatives higher than second 
order in (120) and write it in the form 


= = ft) P+ - = 2y(@) Puen, (21) 


where we sum i and jfrom 1 to f. This is called a Fokker—Planck equation and 
describes the statistical behaviour of an associated classical stochastic process. 
The »/,’s form the components of a drift vector which give the mean motion 
of a; as we shall show next. We have by (105) 


d<a;> _ 
dt 


os t) a; da. (122) 


If we put (121) into this and integrate by parts once on the first term and by 
parts twice on the second, we obtain 


f oe -{ of, (0, t) P,(, t) der, (123) 


since the integrated parts vanish. 

The terms containing 9,; represent diffusion. The 2;,’s are called diffusion 
coefficients and form the elements of a diffusion matrix. Consider two opera- 
tors a; a; and assume ij is in chosen order. 


he. a; da. (124) 


ot 


We then have d 
wasn] 


If we again use (121) and integrate by parts, we obtain 
d 
au P(e, t) da {a; Sj; +0; S;+29,;}, (125) 


which shows how the diffusion affects the motion. 
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3.3. TWO-TIME AVERAGES. SPECTRA‘? 


A complete statistical description requires mean values of operators at 
different times. For example, in the study of the statistical properties of a mode 
of a radiation field, the fluctuation spectrum is defined by 


{ exp (~ iat) <at(t) a(0)> di. (126) 
The intensity spectrum is given by 


{ “ exp (— iat) <at(0) at(t) a(t) a(0)> dt, (127) 


while the photon number spectrum is 
| exp (— ict) <at(t) a(t) a'(0) a(0)> dt. (128) 


These are directly related to experimentally observable quantities which the 
theory must be capable of predicting. In the interests of brevity we shall 
merely give the results of such averages. 

Let P,(ct | B, 0) be the solution of the equation of motion for the distribution 
function subject to the initial condition that «; = 8;. Then it can be shown that 


<M(t)N(O)> = | da [ dB M°(c) P.(a, t| B,0) Tr[S(0)5°(B—a) N(O)], (129) 


where M and N are two-system operators and S(O) is the initial reduced 
system density operator. 


4. ROTATING WAVE VAN DER POL OSCILLATOR 


It has been shown™ that a single mode homogeneously broadened laser satis- 
fies the Fokker—Planck equation for a rotating wave van der Pol oscillator 
given by 


oP 4] é 
a ae, t) faa Ba (g—iw,— |a|7)oP,— 5q8 9 time | ayes 
oP 
4—— 130 
+ anba*? a) 
where g is a numerical pumping parameter and the equation has been scaled 
in the time and « variables. The chosen order is normal order. It describes a 
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radiation field mode which is damped for g <0 and starts to grow exponenti- 
ally when g>0. g=0 is called the oscillation threshold. For g>0 the oscilla- 
tion builds up but because of the nonlinear terms in |«|?« and |a|?a*, the oscil- 
lations will stabilize at a fixed amplitude, since g —|«|? eventually goes to zero 
as |a|* increases. 

Because of the |a|* terms, we cannot solve (130) analytically. However, 
there are two interesting regimes where we may obtain analytical solutions, 
viz., far below threshold (|«|” <|g}) and far above threshold (|«|? > |g). 


(a) Far Below Threshold (\a|? < \g|) 


In this region (130) reduces to 


oP rs) 0 6?P 
ay eee ay pike 
Ot Oa (gaP) Oa* Grey a4 Oada* * Gs) 


The reader may easily verify that the Green’s function solution is 


1 = —fia —~ 2 
P(%#1B,0) = yr exp | — S—Ferpe mea), 3a) 
where y) 
c(t) = ma [exp (2gt)—1] (133) 


and g <0 (below threshold). We assume at t=0 the system is in a steady state 
given by the Gaussian 


_ Igl lal, p2 
P(B,0) = exp (~ “Pipi? (134) 
The fluctuation spectrum below threshold is seen to be Lorentzian with half 
width |g| and is given by 


~ 4 
[Eto a(0)> exp (i@t) dt = (o,—0)? +92 (135) 
(b) Far Above Threshold \a|* > |g| 


To study the Fokker—Planck equation above threshold, it is best to change to 
polar coordinates and let 


a = rexp[i(P—@,t)]d?a = rdrd@. (136) 
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Then (130) becomes 

oP 1 oO oP 1 oP 1 a ae 

— = — —} r— — ~_ P 137 

ot r a +) Lg~r yr’ P] 3?) 


r? 6g? or ‘Or 
and P is normalized so that 


2a (oe) 
. | ag | P(r, ¢, t)rdr = 1. (138) 
0 0) 
In the steady state, P is independent of ¢ and the d-independent solution is 
_ 2_ 7\2 
| exp (—x7) dx 
2 


—@! 


P(r) = 


The mean number of photons in the steady state is given by 


<ata>, = | r? P(r) rdr. (140) 
0 
Far above threshold we may let the lower limit in (139) go to — oo so that 
<ata>, © g. (141) 
To study the fluctuations about this steady state we let 
r= J/g +7,(¢), (142) 
where r, is small. If we use this and let 
P(r, ,t) = REED EGE Oe (143) 
Vg 
(137) separates into 
OR 6*R 
—— (2gr,R 
Gn oe arg ee 
(144) 
> 1 0® 
et = g _ ad?’ 
The steady state solutions are 
R, = / S exp (—9r1?) 
(145) 


1 


®, SS Sas: 
20 
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so the amplitude has a Gaussian distribution and the phase is completely 
random. The non-steady state Green’s function solution of (144) is 


1 — [ri —n(t)P 
PC 118.0) = Tere (“eG | 
a | roe) ; 2 
«sb exp[in@-do+0,1) ~~ 1}, (146) 
where 


e(t) = - [1— exp (—4g#)] (147) 


n(t) = ro exp (— 2g). 


The fluctuation spectrum is given by 


= . _ 2 + (1/9’) 
i _o (—iwt)<at(t) a(0)> dt = (o,—a)?+ (1/9? * @,~o)* +[(1/g) +287 


(148) 


which is the sum of two Lorentzians. The first has half width g~' and is very 
narrow, and the second has half width 2g and is very broad. 
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The Semiclassical and Quantum Theory of the Laser 


H. HAKEN 


For the purpose of this Summer School we have put the physics rather than 
the mathematics into the foreground. A physicist, who wants to learn about 
laser theory, usually first wishes to know the gross-features, and only after- 
wards does he want to learn about the more refined aspects like fluctuations 
With this in mind, we shall discuss the following: 


1. The structure of laser theory 


2. Rate equations 
Heuristic derivation 
Treatment of typical cases 


3. Semiclassical equations 
Heuristic derivation 
How to come back to rate equations 
Treatment of typical cases 


4, Fully quantum mechanical equations 
Derivation from first principles 
Some different approaches 
Treatment of typical cases 


5. General quantum theory of damping 


Readers who wish to make a quick survey of laser theory may read the 
following sections; 
2.1-2.3; 3.1-3.7; (to equation 140); 4.1-4.3/4.5;  4.12.c; 
4.12.d; 4,13. 
These sections are written such that they can be read without knowledge of 
the omitted ones. 


1. THe STRUCTURE OF LASER THEORY 


Laser light is generated by stimulated emission (as it is expressed by the word 

laser). Both this process and spontaneous emission of light (in conventional 

light sources) are typical quantum mechanical effects. A proper laser theory 

must therefore be a quantum theory. Because light is an electro-magnetic 

field, the fundamental equations must be taken from quantum-electrodynam- 

ics. The emission processes are caused by the electrons of atoms (or solid 
201 
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bodies), thus we have to consider the interaction between the quantized electro- 
magnetic field (in short: field) and a system of electrons, which must be treated 
quantum mechanically, of course, in order for the theory to be consistent. 


Schrédinger equation 
interaction (or second quantization) 

. 4 hae for the electrons 

(matter equations) 


Quantum electrodynamics 


(field equations) 


coupled equations 
without losses: 


proper laser system 


The field equations coupled with the matter equations give the basic set for 
the treatment of all kinds of processes in which light and matter (as far as the 
atomic shell is concerned) are involved. These equations are, however, in- 
sufficient for a laser theory, because they neglect the effect of the surroundings 
—the field can partly escape through the mirrors or may be absorbed by im- 
purities etc. ; the electrons interact with lattice vibrations, the pump-light, etc. 
All these effects stem from external reservoirs (or heatbaths), and are to a 
great extent of a statistical nature. Because laser action can be only maintained 
if the energy losses are balanced by the energy pumped into the system, it is 
quite clear, that any useful laser theory must incorporate these statistical effects. 


heatbaths, (reservoirs), 
(pumplight, cavity 

losses, lattice 

vibrations etc.) 


proper laser system interaction 
field matter Roa ra 


realistic laser equation 
fully quantum mechanical 


A possible way of arranging the present lectures would be to derive the 
realistic laser equations from the very beginning and then to discuss their 
solutions.“ We have, however, chosen a different way because the realistic and 
complete laser equations are, in general, rather complicated. On the other 
hand a number of gross-features of laser light can be understood by much 
simpler equations. This may be justified by asking the question as to what we 
expect from a laser theory. Firstly, we want to know the conditions under 
which laser action occurs, and the intensity of the laser light produced. The 
next question perhaps would concern the actual frequency of the laser light. 
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Finally we observe that the frequency is not completely sharp but has a 
finite width due to quantum fluctuations. A closer inspection of the complete 
laser equations reveals that their full content of information is needed only 
for the last question. To treat the second question it suffices to use equations 
which are averages of the fully quantum mechanical equations. These equa- 
tions are called the semiclassical equations. Finally, the first question concerns 
only intensities, or, photon numbers, so that in general it is sufficient to discard 
phase relations and to use the so called rate equations for intensities. Thus we 
have the following scheme. 


fully quantum mechanical equations > linewidth, 
intensity fluctuations 


average over quantum fluctuations 


semiclassical equations ——-> frequency shifts 
mode pulling and 


pushing, phase-locking, 
ultra-short pulses 
(~ uu SEC) 


elimination or neglect 
of phase relations 


| rate equations ——— > threshold condition 
light intensity 


For a more complete survey consult Table 1 


2. RATE Equations” 


2.1. EQUATION FOR THE PHOTON NUMBER: LASER CONDITION 


We all know from the usual quantum mechanics of particles that there exists a 
dualism between particle aspects (momentum, position) and wave aspects 
(frequency, wavelength). Similarly, such a dualism exists between photons and 
waves in quantum electrodynamics. The use of rate equations rests on the pos- 
sibility of describing (at least under certain conditions discussed below) the 
light field in the photon picture. Following Einstein, we can define generation 
and annihilation rates of photons and thus establish rate equations. Let us 
consider a single mode with a photon number 27, which is treated as a con- 
tinuous variable. Its rate of change dv/dt, is caused by a number of different 
effects. 
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TABLE I 


Quantum mechanical 
equations 


Linewidth intensity fluctuations, 
Hanbury-Brown-Twiss experiment 
coherence, photon statistics 


Avetage over pumping 
and relaxation processes 


Semiclassical equations 


Neglect of all phase 
relations 


Rate equations 


Frequency shifts, time dependent 
population pulsations, modulation 
effects, undamped spiking, photo echo, 
super-radiance, phase locking, 
ultrashort pulses, multiple quantum 
transitions with correct phases, 
harmonic generation, stimulated 
Raman and Brillouin scattering 


Progressing 
simplification 


Threshold condition, output power as a 
function of pump power, coexistence of 
modes in homogeneously and 
inhomogeneously broadened lines, 
laser cascades, giant pulse laser, 
damped spiking 


Emission and absorption of photons 


Each excited atom spontaneously emits photons into the mode under con- 
sideration at a rate W. For simplicity we assume that this rate does neither 
depend on the position nor frequency of the individual atoms. This assump- 
tion is correct if the mode has infinite wavelength or is represented by a 
travelling wave (W : space-independent) and ifthe atomiclineis homogeneously 
broadened (W : no individual frequency occurs). 

If N2, is the occupation number of the atom yp in its excited state, the total 
rate of spontaneously emitted photons is 


N 
y WN2, re (1) 
p=1 


where the sum runs over all N atoms. Phase relations do not appear. The 
photon generation rate by stimulated emission is obtained from (1) by multi- 
plying it with the photon number 7: 


WN”. (2) 
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On the other hand, the rate of stimulated absorption is proportional to the 
same rate W, to n, and to the number )'7_ , N,_,, of atoms in the lower state (1) 
and is given by 


N 
— WN, 7. (3) 
pel 
The net gain of the stimulated process is thus 
N 
Wz —Ni)u n. (4) 
a> 


Losses. The photons may escape from the cavity through the mirrors, by scat- 
tering processes, etc. Their decay rate is proportional to the number of pho- 
tons present and is given by 


—2kn, (5) 


where x is the cavity halfwidth. Putting the net gain (4) and the losses (5) 
together we find the photon rate equation: 


N N 
dni/dt = — 2kn+ ¥ W(N2—-Ny)un + > WN, (6) 

n=l p= 
It always appears strange, that (6) contains no phase relations at all. We 


shall discuss this problem below. Because W does not depend on p, we may 
introduce total occupation numbers 


Nij,0ot = =M >; J=1,2, (7) 
and the total inversion (‘population difference’) 
D = Nayar Naser = 2 (N2- Ne () 
With (7) and (8), the eqn (6) takes the form 


dn/dt = —2xkn+D Wn+Noa iW. (9) 


It is customary to neglect the rate of spontaneously emitted photons because it 
is much smaller than the stimulated emission rate. Actually, there is a deeper 
reason for neglecting WN,,,, compared to DWn and this will be treated in 
Section 4. 
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Equation (9) allows us to derive a laser condition. Laser action takes place 
if the generation rate is bigger than the loss rate, i.e. if 


W .D> 2k, 
or 


W(N2-N)tor > 2K. (10) 


Thus for laser action to take place, the inversion D = (N,—N,),,, must be 
made sufficiently high. The photon life-time, t,, in the cavity is related to 2x by 


2k = 1/t;. (11) 


We now derive an expression for W. Let the radiative lifetime of an atom be 
t, so that it emits 1/t photons per second spontaneously. We are, however, 
interested only in one special “mode’’. We have therefore still to divide by p, the 
number of modes within the spontaneous linewidth Av of the atom. The num- 
ber of photons spontaneously emitted per atom per second into a given mode 
is thus 


W = 1/tp (12) 


For a Lorentzian line, p is given byt 827v?V Av/c? so that one obtains the 
laser condition® 


(N2 —N4)eot e 
= et — > 2k, 13 
SevVAy a?) 
where V is the volume of the cavity and c is the velocity of light. In order to 
satisfy (13) the following conditions must be fulfilled. 


(a) The lifetime must be large enough. This requires a detailed study of the 
losses of decaying modes in the Fabry-Perot resonators as well as in other 
resonator types. Also other loss mechanisms as well should be considered, 
e.g. impurity scattering, or coupling between modes. 


(b) The atomic linewidth Av must be small. This is mainly a question of select- 
ing materials but nevertheless requires a theoretical investigation of broaden- 
ing mechanisms. Note that Av can be much bigger than (1/t), because Av 
includes all kinds of broadening, e.g. lattice-vibrations, etc. 


(c) The inversion (N, — N,) must be high enough. This requires a detailed 
knowledge of excitation mechanisms, e.g. optical pumping, etc. 


+ Here vy is the frequency defined as usual. Note, however, that in the remainder of the 
chapters, v denotes the circular atomic frequency. 
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2.2. THE MATTER EQUATIONS 


Because of spontaneous and stimulated emission, the atomic occupation num- 
bers change. To find an equilibrium the atoms must be continuously pumped 
into their upper states. We first treat two-level atoms, and describe these pro- 
cesses by the following two equations for the single atom p 


AN ,/dt = —No .W23+NiyWi2—(N2—N),Wn, (14) 
AN, ,/dt = Noy Wo1—Wi2 Nyy t+(N2—N,), Wn, (15) 


where the w’s are transition rates. 

The external pump is contained in w,,, and w,, contains both spontaneous 
emission into all modes and radiationless transitions. In practice the pumping 
is achieved via a third level which decays very rapidly into the upper level of 
the laser transition. Due to the fast decay, the level 3 may be assumed to be 
nearly unoccupied. Therefore, for each atom, we have 

3 
fast decay 
pump . 
laser transition 


i 
Fic. 1 


(N,+N2), = 1 (16) 


Summing up the eqns (14), (15) over all atoms and using (8) and (16), we 
find 


dD D o—D 
— = — 2DW 17 
dt T ie aD 
where 
Wi2—-Wa1 I 
Do = —————_N, —= = Wy2+Wa, 18 
0 WiotWoy T 12 21 ( ) 


and N is the total number of atoms. The unsaturated inversion, Do, is obtained 
by the pump and all other incoherent processes but in the absence of laser 
action (i.e. 2 = 0). The actual, or saturated inversion is D. Equations (9) and 
(17) are the complete laser equations. 


2.3. STATIONARY SOLUTION OF EQUATIONS (9) AND (17) 
When the system is stationary, we have 


dn/dt = 0 and dD/dt = 0. (19) 
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From (17) we obtain 
D = Dyo/(1+2T Wn), (20) 


which determines the saturated inversion D by means of the unsaturated 
inversion Dy and the photon number n. Inserting (20) into (9) and solving the 
resulting equation for ” we obtain 


n = (DoW — 2) /4T Wx, (21) 


where we have neglected the spontaneous emission term. This equation allows 
us to determine the actual photon number as a function of the unsaturated 
inversion. When we require that the number » starts to be positive, we reobtain 
the threshold condition (10). There is a strange difficulty with rate equations. 
The photon number ought to be positive even if (10) is not fulfilled. We might 
try to include the spontaneous emission term (1), but then we mix two proces- 
ses which, intuitively, apply to different situations of ‘‘coherence’’. This prob- 
lem has been solved in a satisfactory way only recently; its solution will be 
discussed in Section 4. 


2.4. RATE EQUATIONS FOR THE THREE-LEVEL SYSTEM 

In most cases laser action occurs between two excited levels (see Fig 2.). The 
field equation remains the same (see eqn (9)), but the matter equations 
become 


dN ,,/dt = —W No n+Wo2 Non—WN2-Niaat, (22) 
dN,,,,/dt => —Wio NiytWar No ytWor Not W(N2—-N,)u n, (23) 
dNo,,/dt = —Wo2 Nout Wio Nip (24) 


We shall discuss some important special cases of (22-24). 
Because the electron is in any of the three levels, we have 


NoptNiptNou = l, (25) 
which allows No,, to be eliminated. In many cases the ground state may be 
2 
Wo; laser transition 
pump | 
Wo} Wio 
re) 
Fic. 2 


treated as a reservoir. If the pump rates Wo2, Wo, are much slower than the 
decay rates w,, and w,9, most of the atoms are in their ground states, so that 


No, © 1. (26) 
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Using (26) in eqns (22) and (23) we obtain 


dN, ,/dt = Wo2—-W21 No n— W(N2-N1)p n (27) 
dN, ,/dt = Woi1—-W10 Nyt W(N2—-N,), n, (28) 


In the second equation we have also neglected a term w2,N,_, in order to be in 
accordance with other approaches. Without laser action, N,, and N, , would 
arrive independently at their equilibrium values (Wo2/w2,) and (Wo1/W10); 
respectively. 


The laser equation can be simplified further if the recombination rate w,, is 
high compared to all other rates. Then we have 


N 
Ni,2%0 and Dx YNoy (29) 
n=l 


In this case we may drop eqn (28). Summing eqn (27) over yu we find 


AN tor/dt = NWo2—W21 No 10. — WN 2,101 (30) 


This equation allows us to transcribe all results obtained for the two-level 
system to the three-level system; N2,., replaces D, with the only difference 
that 2W in eqn (17) is replaced by W in eqn (30). 


2.5. TIME-DEPENDENT SOLUTIONS :“ 
RELAXATION OSCILLATIONS WITH SMALL AMPLITUDES‘ 


In contrast to the stationary case, where we can find the exact solution, the 
time dependent case is much more complicated because the eqns (9) and (30) 
are nonlinear, However, for small oscillations around the equilibrium values, 
these equations can be linearized. We treat the upper laser transition, and 
start from the eqns (9) and (30). We put 


D® Nowe = N39 tor tON 2,t0t (31) 
and 


n =No+0n, (32) 


where the stationary solutions ny and N3 ,., are fixed by the conditions 
—2K+WN3 top = 0 (33) 
and 


NwWo2 — NB tot W21—7No N39 too =0 (34) 


210 H. HAKEN 


Inserting (31) and (32) into (9) and (30) using (33) and (34) and linearizing we 
obtain, after a slight rearrangement, the following equations 


d = ON 2 tot 


aS a 2 35 
Pr on No... KNo (35) 

d ON 2 to 
=F ON 2 tot es ne NwWo2 —SNN3 tot W. (36) 


In order to solve this system of linear homogeneous differential equations 
with constant coefficients we put 


on = Aexp (at), (37) 
ON 2 to. = Bexp (at), (38) 
and a has then to obey the equation 
a? + (Wo2NW /2k)a+ (WNwo2—2w21K) = 0. (39) 
According to the two roots of eqn (39), én can be written in the form 


én = A, exp[—(I —ia,)t] + A, exp [—(P'+io,)1], (40) 


where 
T= Wo2W21/2Wie (41) 


and w, and @,,,, respectively, are given by 


2 2 + 
Wo2 W21 Wo2 
=| -—- —_—— — 1)2 42 
es awe, es eva| va) 
Wear = 2w2,K/NW. (43) 


It is evident from (40) that the photon number, after a deviation 6n from its 
equilibrium, returns to its equilibrium value ny. The relaxation oscillations 
occur with the frequency ,. 


2.6. THE GIANT PULSE LASER 


Another example for the application of rate equations is the giant pulse laser. 
It should be noted, however, that phase-locking effects may play a very im- 
portant role in the formation of short pulses, and since phase information is 
not included in the rate equations, they will not predict these effects. 

Let us consider a resonator whose quality factor Q is kept very low at the 
beginning so that no laser action can occur. Then the pumping drives the popu- 
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ulation inversion to a higher value than that which can be maintained during 
the laser process. If now the Q is suddenly increased, the feedback causes 
laser action at a considerably higher output power level than can be achieved 
in steady-state operation. This method, called Q-spoiling,“ can be experi- 
mentally verified by rotating mirrors (or prisma, or shutter discs) so that the 
reflectivity of one mirror changes periodically with time. The switching can 
also be achieved by a Kerr cell,“ which is particularly suitable, if the light of 
the laser-active atoms is already polarized (especially of ruby crystals, for 
example). 

Instead of changing the loss, we can change the gain. For example, in ruby, 
we can cause a quasi-Stark effect, which splits the atomic line, thus giving rise 
to an increased effective linewidth and a lowered gain. When the electric field 
is released, the laser pulse builds up. 

The whole time-evolution of this process can be treated by means of the 
eqns (9) and (30) which we solve semi-quantitatively. In the beginning (t = 0) 
the Q-value is kept low (i.e. 2x is kept high). Under a steady pumping D and 
n assume steady state values D;,n,;. Note that D, is essentially determined by 
spontaneous emission. At t=0 we switch the resonator suddenly to a high Q 
(low 2x,). Because the losses are low, we expect the laser pulse to build up so 
quickly by induced emission that the spontaneous emission rate (as well as 
nonradiative decay rates) can be neglected. Furthermore we neglect the further 
pumping. The laser equations then take the form 


dnjdt = (WD—2k,)n, (44) 
dD/dt = — 2WnD. (45) 


According to (45), the inversion D decreases rather slowly for small values of 
n, so that we may replace D on the right hand sides of (44), (45) by its initial 
value D,. The solutions of (44) and (45) then read 


n = n, exp (at), (46) 
D = D,{1+2Wn[1— exp (at)]/c}, (47) 
“= WD,—2k,. 


In the first phase of the pulse, m increases exponentially with the gain constant 
a and then, because of (45) or (47), the inversion decreases. The increase stops 
certainly at a time t,, when D(t,) ~ 0 (in fact, it stops earlier due to the loss- 
term 2x,). Neglecting 1 compared to exp (at), it follows from (47) that 


n; exp (at,) = «/2W = tax (48) 


or 
t, = (1/a) In («/2Wn,). (49) 
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Equation (48) represents incidentally the number of photons present at this 
time, t,. After the time ¢,, we may assume D ~ 0. The photon number then 
decays approximately exponentially 


n(t) = MNmax ©XP [— 2K, (t— t,)]. (50) 


The equations we have presented are at least good enough for rough esti- 
mates of the quantities ¢,, 4,,, and n(t). For a more detailed analysis, the 
eqns (44) and (45) must be solved in the vicinity of t = t, by a computer (the 
slopes are well represented by (46) and (50)). This has been performed by, 
for example, Wagner and Lengyel,“'™ who have also obtained several exact 
relations. 


2.7. GENERAL FORMULATION 


Above we have treated the example of a single mode interacting with a set of 
two- or three-level atoms. We now want to formulate the rate equations quite 
generally for M modes and a system of L-level atoms. In particular we shall 
take spatial effects into account. The modes are distinguished by an index A, 
where A not only refers to the wavelength, but also to all other spatial proper- 
ties of the modes. These cavity modes interact with N atoms at random lattice 
sites, x,. The optical transition shall occur between the levels 2 and 1. The 
levels 1 and 2 may have any position in the system. The other levels participate 
in the pumping process. Again the rate equations can be divided into two 
groups, («) the field equations, and (f) the matter equations. 


(«) The field equations describe the rate of change of the mean photon number 
n, of the mode A under the influence of losses and gain. The losses can be 
caused by transmission through the end-faces, diffraction, scattering by impur- 
ity centres, crystal inhomogeneities, etc. These effects are taken care of by 
attributing a decay time t, = x,/2 to the mode A. The loss rate is thus given by 


—2k, Nn; (51) 


The gain is due to spontaneous and induced emission. The spontaneous decay 
rate from the level 2 of the atom yp into the mode / is proportional to the occu- 
pation number N, , of the upper laser-active state 2 of the atom yp. We may 
further guess, that it is proportional to the mode intensity at the atomic 
position x, (at least in the dipole approximation). 
If U, (x,) is the amplitude of mode A, then its relation with the optical transi- 
tion rate is 
Wo |U, (x,)I?. (52) 


A typical example is U,(x,) = /(2/V) sin k, x, where V is the volume of the 
cavity. Furthermore if the line connected with the optical transition 2 > 1 has 


—_— 
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a Lorentzian shape of half-width y, we expect that in the case of detuning, we 
have 
2) 
Wo =————, (53) 
y ate (vy = @,) 

where v, is the (circular) centre frequency of atom yu and w, is the frequency of 
the mode J in the unloaded cavity. Finally, we expect that W goes with the 
square of the atomic transition dipole moment 6,,, taken in the direction of the 
mode polarization e,, i.e. 


Wx |e, 02,7. (54) 


These considerations make it clear why the exact derivation of W yields 


2y 
OF 2 =. 2 
Win P+ (Vy, —@,)" IG nal ? (55) 
where 
; 2nw, \* 
Qua = 1021 ( oe) Us (56) 


(or gua = 9V(V) U,(%,), where g does not depend on A, if we restrict our 
analysis to a single polarization). We have supplied W with all necessary 
indices: 0 means optical, A refers to the mode, p to the atom. 

The induced emission and absorption rates can be obtained in complete 
analogy to Section 2.1. The net gain is given by 


nN, WiAN 2— Nadu (57) 
We thus obtain the field equations 
ny, = —2k,njytnm > Wo (N2—Niaut » Wo No we (58) 
u Bu 


(B) The matter equations can be derived quite similarly. For the levels partici- 
pating in laser process we obtain 


Now = » w,2 N12 —-N2, 3 W217 ny ny Wu (N2—N1)y- (59) 
iF 2 i#2 a 
Nip 2 Wh1 Niu—Nia Dwar pe Ny W3AN2 —Ni)y (60) 


For the other levels which are used by the pumping process we get 


Nee = > Win Niu—Ninu 2s Wmie (61) 
l#ém lém 


- = —_——_—S—~S 
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The eqns (59)-(61) are not independent of each other. If we sum up these 
equations over all levels of the atom yp, we obtain 


(d/dt) ( EN) = 0, (62) 


so that 
YN, = constant. (63) 
I 


The fact that the sum over the occupation numbers, N,,, of a single atom is a 
time independent constant is a consequence of the conservation of the elec- 
tron number in a single atom. Hence we have for the single electron problem 
under consideration 

LN ba = 1. (64) 


The transition rate from level / to level m is w,,, and is caused by external 
pumplight, collisions of the second kind, excitation collisions with electrons, 
etc., and all nonradiative and radiative spontaneous transitions. In the follow- 
ing we drop the term )}W2,, N2,, in eqn (58) since this expression describes 


BL 
the generation of “incoherent photons”. (For a deeper motivation and a 
detailed discussion see Section 4.) 


2.8. TREATMENT OF THE STEADY STATE 


The main task consists in the determination of the photon number as a func- 
tion of the pumping, and especially in the derivation of a threshold condition. 
We describe the general procedure. In the steady state we put 


ni, = N,, = 0. (65) 


From (59), (60) and (61) we obtain for the inversion d, = (N,—N,), an 
expression of the form 
A 

4, = ——_—__ ——__ —____.——_.. 66 
* B+ (Wi yp tnW3 pt. tay Wh) Cy 69) 
A,, B,, C, do not depend on the n,’s (but still contain the transition rates, 
Wim). Equation (66) can be derived as follows. Because of (64) we need only 
consider eqns (59), (61). We drop, for example, the equations with index 1 
and introduce the new variables (N,—N,) and (N,+N,) which replace the 
variables N, and N,. The ,’s then appear in only a single equation where they 
act as factors of d, = (N2—N)), in the form 


yy n,W?. w (67) 
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Using the general theorem about the solution of linear equations by means of 
determinants we can write 


d, = A,/A (68) 


where the total determinant A depends on (67) at most in a linear fashion, 
because (67) occurs only in a single element of the entire A. Furthermore, A, 
is obtained from A in such a way that the column containing (67) is cancelled 
and replaced by the inhomogeneity introduced by eqn (64). Consequently, 
A, does not depend on the n,’s. For an explicit evaluation of the A’s B’s and 
C’s it is advisable to treat the 3- and 4-level system explicitly. The photon 
numbers (and thus the emitted coherent power) can be determined as follows. 
We assume that M modes show laser action so thatn, > OforA =1,...,M 
(1’, A” also denote modes) After dividing the eqn (58) by n,, we find 


2K, = 3 Woiu (N2 —N1), (69) 
i 


or, on account of (66), 


W°,,A 

2x, = YS (70) 
BCL Wi, 

There are M equations for the M unknowns, 7,, for which the condition 

n, > Oholds. If it turns out that some n,.. < 0, these n,..’s and the correspond- 

ing eqn (70) must be dropped. In order to obtain an explicit solution of eqn 

(70) it is generally necessary to treat special cases. 


2.9. A THEOREM ON THE MODE SELECTION IN THE COMPLETELY 
HOMOGENEOUS CASE 


We assume that the atomic line is homogeneously broadened so that the transi- 
tion frequencies v,, are equal (= v9). We represent the modes by running waves 
so that they possess a spatially independent intensity. Then the optical transi- 
tion rate does not depend on uy. Finally, it is assumed that the pumping pro- 
cess (and the decay rates) are spatially homogeneous. Under these assumptions, 
A, B, C do not depend on the atomic index p so that (70) simplifies to 


-_—=. (71) 


Because the left-hand side does not depend on the mode-index A, we find that 
W,°/K 3. (72) 


must be independent of A. Consequently, the eqn (71) can be solved in a con- 
sistent way only for those modes for which the condition (72) is fulfilled. 
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From (71) it can be deduced that the mode for which W,° has a maximum, 
starts laser action first. If this maximum is assumed only for one mode, then 
this mode remains the only laser mode for every pump strength. In the com- 
pletely homogeneous case only a single mode (or degenerate modes in the sense 
of (72) with maximum (W°]k)) can oscillate? 

The laser condition is obtained from (69) by putting all n,’s equal zero. 
Using (55) we find 


2ylg|? 


2K = (N2—N,)inresnota -N Psi) 


(73) 


2.10. THE COEXISTENCE OF MODES CAUSED BY SPATIAL 
INHOMOGENEITIES IN A HOMOGENEOUSLY BROADENED LASER 


We consider the case in which the line is homogeneously broadened and the 
modes depend on the space points, for instance, like a standing wave. We shall 
base our treatment of the problem on the following approximation for the 
determination of the photon-numbers. We have seen above the photon numbers, 
n,, can be obtained in principle” by solving the eqn (70). This requires, 
however, that the sums over yp are performed, but these can be done ex- 
plicitly only in very special cases. We therefore expand the right-hand side 
of eqn (70) into a power series with respect to }' n,, W,°, where we usually 
os 


confine ourselves to terms linear in the photon number. As long as the laser 
does not operate too highly above threshold, the photon numbers are still 
small, so that this approximation is well justified and can be substantiated by 
numerical examples. We thus obtain from (58) the following set of linear 
inhomogeneous equations for n, (with the supplementary condition n,>0), 


A,C A 
> ml Wow W%., = = 2K,+ > WE u a (74) 
B P 


2 
uA B, 


From (74) again the threshold condition follows by putting all n,’s equal zero. 
As an explicit example,“!”? we treat modes whose space dependence is 
given by sin kx, so that 


WS, ~ sin? k, X,. (75) 


We shall consider single mode operation first. The expressions for d,, show that 
the inversion is unaltered where the mode has its node. Thus a second mode 
with a different wavelength can still “‘eat” from those portions of the crystal 
in which the inversion is not appreciably diminished by the first mode. We 
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shall confine ourselves to the one-dimensional case, as indicated by (75), or 
to modes with k-vectors which differ only in one component. We then obtain 


. Ws a = W;, 
B 


NY W?, We, =3W,2, for 4=7, (76) 


B 


= W, W,), for AF XN, 


where W,, = 297 y/{(vo—@,) +} N. 
The system of eqn (74) thus takes the form 
B 2K, 
3W,n,+2 Wy my = 22-(1- )N=b, 77 
AM) au MER C dW, A ( ) 


where we have put A/B = dp. Introducing W,n, as new variables, we observe 
that (77) has a completely symmetric form, which immediately allows the 
explicit solution 


| Wht. th) 
Wit =l.- a 
_ al 1 2k, Meld + nl Pal) (78) 
C \2M+4+1. doW, (2M +1) do , 


where M is the number of coexisting modes. 
For the further evaluation of eqn (78) we treat two examples, # and f, 
below. 


(a) Axial modes with a different frequency distance from the line centre 


Let the axial modes be discriminated from the nonaxial modes by a large 
difference between their losses so that we can confine ourselves to the axial 
modes alone. These modes shall have the same losses, x, but they are still 
discriminated by their different frequencies, w,,. Putting @, = Vo+m,6 where 
Vo is the frequency of the atomic line centre, m, is the mode index and 6 is the 
frequency spacing, we obtain the following as the condition for n,>0 (where 
@, = Vo+Md) 


1 K : 1 (2s 2x6? 
ie a 6? —_——_— + ——_, 2)>0, 79 
SMa yg?do % 1! OO+ aM IN dg? Tg am 
or” Ky? Kd? 


_ 


—— - Mo2(2M +1)—2 Ym} > 0. 80) 
797dp ade o( dm} ( 
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Because we have assumed a symmetric position of the mode frequencies with 
respect to the atomic line centre, we have 


Mo = (M—1)/2, (81) 


where my is the index of the mode with maximum separation from line centre. 
Thus we can write eqn. (80) finally in the form 


Ky? KO” 


ee a ee 
197do yo 


The higher the inversion dy and the smaller the frequency difference (com- 
pared to the linewidth y), the bigger is mo, and thus, the number, M, of 
coexisting modes. 


(8) Axial modes with different losses 


We assume that the mode with the lowest x lies exactly at resonance, and that 
the other modes which are lying close to the resonance are non-axial modes. 
For the sake of simplicity we assume that all these modes are situated in the 
same plane in k-space. Because the spatial modulation varies now in a single 
direction (perpendicular to the laser axis), a spatially inhomogeneous inver- 
sion results, so that again a factor 3/2 occurs in eqn (76). (The modes shall 
have radial symmetry). We can neglect the change in frequency but we have to 
take into account the fact that the losses, x,, increase with an increasing angle 
between the propagation direction of the modes and the laser axis. We put, 
therefore, 

Km = K(1+q)"71. (83) 


where q is a small dimensionless quantity. 
Inserting (83) into eqn (78) yields an expression for the number of modes, M, 
which is 


M?< solid (1- om) (84) 


When we express the inversion dy by the threshold inversion d. = 2x/W in the 
form dy = d, (1+) eqn (84) can be further simplified to give 


M? < n/q. (85) 


If, for instance, g = 1077, ie. if the losses differ only by 1 °/, the coexistence of 
modes becomes possible by increasing the inversion by a few percent. The 
coexistence of modes can be weakened by cross-relaxation between atoms at 
different space-points or by diffusion processes so that the spatial dependence 
of the inversion is flattened. 
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3. THE SEMICLASSICAL APPROACH 3) 


Heuristic derivation of the basic equations 


In the semiclassical approach, the electro-magnetic field is treated classically 
by means of Maxwell’s equations, but matter (i.e. the electrons) is treated 
quantum-mechanically. To connect both treatments properly, it is necessary 
to describe the dynamics of the electronic motion by equations referring to 
averages which are quantum-mechanical and statistical in nature. Because this 
approach can be fully understood only in a completely quantum mechanical 
treatment of the laser, we postpone the rigorous derivation to Section 4 and 
confine our present treatment of the semiclassical equations to a heuristic 
derivation. We start with the wave-picture and derive the mode-picture after- 
wards. 


3.1. WAVE-PICTURE*) 
a. The field equations 
From Maxwell’s equation 


curl H = 4nj/c+ (E+ 42P)/c, (86) 

and using the relationsf 
j=oE, H=curlA, E= —A/c (87) 

and the Coulomb gauge 
div A = 0, (88) 


we obtain a differential equation for the vector potential A 


. 4 4 
Bay ray eid ea (89)t 
c c Cc 


and, by differentiation with respect to the time, a differential equation for the 
electric field strength 


1. 4 ‘. An 
Lee ha kes P (90) 
c c c 
b. The material equations 


The polarization P in eqn (90) is not a fixed source, but is caused by the 
electric field, E. We adopt in this chapter the following laser-model. The 
material consists of a solid state matrix, e.g. a crystalline lattice, in which the 


+ note, that o may bea tensor. 
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laser-active atoms labelled with an index p, are embedded at random lattice 
sites x,. We assume that the solid state matrix causes a polarization, which is 
proportional to E. The effect of the inactive solid state matrix may be taken 
care of by using an effective light velocity. In the following we therefore con- 
sider only that part of P which stems from the laser-active atoms. We de- 
compose P into the single atom contributions 


P(x) a > 6(x—x,) Py» (91) 


where 6 is Dirac’s 6-function and the sum runs over the laser active atoms. 
Each p, can be visualized as decomposed into ex where e is the electronic 
charge and x is the displacement of a local oscillator. In a purely classical 
treatment, the equation for a component of p, would read (as in the usual 
dispersion theory), 


2 
oo J e 
Pat 2y p, +,” P, = car E(x,), (92) 


where v, is the frequency of atomic polarization. The damping constant y can 
be identified with the atomic width. This equation is, however, insufficient, 
because it does not account for the effect of saturation. The quantum mechani- 
cal calculation shows that the strength and the sign (i.e. the relative phase) 
between p and the driving field depends on the atomic inversion, which is 
defined as the difference d, = (N2—N,), between the occupation numbers of 
the upper and lower level of the lasing atom, y. Anticipating further a quantum 
mechanical replacement of the factor (e?/m), we may write the quantum 
mechanical analogue of (92) in the form 


O71 
h 


B,+2y Put, Py’ = — 20 E(x,) dy; (93) 
where 0,, is the atomic dipole moment matrix element and wy is the average 
mode frequency. For simplicity we have assumed that the vectors 0,2, p, and 
E are parallel, so that @,, enters eqn (93) in the given way. 

Multiplying eqn (93) by 6(x—x,) and summing over p we obtain (pro- 
vided the atomic line is homogeneously broadened, i.e. v, = v) 


Le 


P(x, t) + 2y P(x, t)+ v? P(x, t) = 2a - 


E(x) d(x), (94) 


where d(x) is defined by 
d(x) = ))6(x—x,) d,, (95) 
BB 
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The inversion (population difference) d, = (N2,—N)), itself may change due 
to radiative emission and due to relaxation processes (which include the pump- 
ing of the atom). Accordingly, the rate of change of the atomic energy hvd, is 
given by 


d d(hvd,) (e 1) 
—— = | ? 
t (hvd, ( di = s dt rae ¢ " 
where 
( d(hvd,) = 2E(x,) ‘ P,, (97) 
dt field 


is the usual classical expression. 
If we confine ourselves to a two level atom we have 


d(fivd,) hv (dy—d),, 
= —___ 98 
(Se dt Vos T p ( ) 
where T is the effective relaxation time to restore the equilibrium value dp. 
Putting (96), (97) and (98) together we find 


—_E(x,). B,- (99) 
y 


Multiplying eqn (99) with 6(x — x,) and summing over p we obtain for the 
inversion density d(x) 


(do — 4)(x) 


7 ee E(x). P(x). (100) 


d 
di d(x) = 


Equations (90), (93) and (99) are the basic equations for a set of laser modes 
interacting with a set of two-level atoms. In the case of multi-level atoms the 
eqns (100) and (93) remain valid, but (99) acquires a more complicated struc- 
ture. The eqns (90), (94) and (100) describe the same problem in terms of a 
continuous polarization and a continuous inversion density. These equations 
may be used as a starting point for the treatment of pulse propagation in 
laser-active media, like 2-pulses, 27-pulses (i.e. self-induced transparency), 
and self-pulsing.’” For a treatment of some of these effects see the chapter 
by Bloembergen in this volume. In the following, we treat explicitly eqns (90), 
(93) and (99) and leave it to the reader to perform the same steps with eqns 
(90), (94) and (100). 
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3.2. MODE PICTURE 


We decompose E(x, f) into the normalized eigenfunctions U,(x) of the 
unloaded interferometer (“‘cavity”’) 


E(x, t) = ps E,(t) U,(x). (101) 
The U’s obey the homogeneous wave equation, so that 
“ost 
AU, = —k,?U, = — a U,. (102) 


Because the U’s are defined within an open cavity, they are not exactly 
orthogonal, although for practical purposes it is sufficient to assume their 
orthogonality. The open cavity causes diffraction losses, which may be taken 
care of by an “effective” conductivity o in eqn (90). Inserting (101) into (90), 
multiplying the resulting equation with U,(x) and integrating over the volume 
of the “‘cavity” we find 


@,°E, + E, + Vi 4noyy Ex = 4nP,, (103) 
V 
where | 
On = | vou, dv (104) 
and - 
B, = [u, Pav. (105) 


We abbreviate 270,, = K,, where x, is the cavity halfwidth of mode A and 
2NO zy = Kay, (A # A’). In the following we put o,,, = Offord 41’. 

T he matter equations can be immediately found from eqns (93) and (99) by 
inserting the expansion (101) for E, which yields 


Kea 


B, + 2y b, + 2p, = — 20 ; dy ¥) Ex(t) Ux), (106) 
and 
d,—d 2 
d, = ——* + — p,  E,()U,(,). (107) 
T hv, Aa 


3.3. THE ROTATING WAVE APPROXIMATION 


This approximation can be applied to both the wave and mode picture. 
We shall treat explicitly the mode picture and decompose E, and p, into a 
positive and negative frequency part to obtain 


E,(t) = exp(—iw,t) BE, () + exp(io,t) EO) = EXOOM+ES, (108) 


p.(t) = exp(—iv,t) BP () + exp (vf) BO = p+ pO, (109) 
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where w, is the frequency in the unloaded cavity. It will become evident below, 
that in a laser we have w ~ v, and that the remaining amplitudes E‘*?, p‘* 


vary much more slowly than exp(iwt). We therefore neglect E , compared to 
w, E,, and pf, compared to vp,. Thus we obtain 


EO +4, EO = —2niwy YU,(x,) BO expliv,—@,)t], (110) 
Bu 
where Wy © @, & v,. Using 


ES = exp(tiow,t) ES; p,{ = p, exp(+iv,), (111) 


we obtain, finally, 


EX = G@,—K,) EX —ioo2n¥, py U,(x,). (112) 
u 
Similarly we find 
Oe os ~) 121? 5 
BuO = G9) DLO 47S VU) EO dy (113) 


where y is the atomic line half width, and 


—d 2i Z - 
,~$e54 2 [(pa0.69) 90 (Fer uo) 


(114) 


where we have neglected terms ~ exp (+ 2iwt) which are small compared to 
unity (rotating wave approximation). The eqns (112), (113) and (114) again 
represent a complete set of laser equations. 


3.4. DIMENSIONLESS QUANTITIES 


In these lectures we shall use a notation which makes contact directly with 
the quantum theoretical treatment by putting 


ES? = i./(2nhm,) b,* (115) 

E,™ = i./(2nhq@,) b,. (116) 

In a fully quantum mechanical treatment, b,*, b, are creation and annihilation 

operators. Here, they are merely time dependent, complex dimensionless 
amplitudes. 


We decompose the atomic polarization p‘*) into a quantity with the dimen- 
sion of a dipole moment (ex, ,) and a remaining dimensionless factor p,,, thus 


Pao? = O21 + Paty (117) 
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and correspondingly 
ps = 0,2. P12,e (118) 
Further, we write 
Now me P22,p (119) 
and Nip = Pi1,n (120) 


In the quantum theoretical treatment, p,,, P12, etc., are the elements of the 
density matrix of the two-level atom. Both in that treatment and this, they are 
time dependent classical functions. Finally, we introduce the coupling 
constant 


Dua = 103 nl ( “nes U;, (x,). (121) 


By use of (115), (116), (117), (118), (119) and (120), the original eqns (112), 
(113) and (114) acquire the form given in the next section which we shall use 
in the subsequent sections. 


3.5. THE SEMICLASSICAL LASER-EQUATIONS“*? 
The complete set of equations for two-level atoms is given by 


b,*(t) = (@y—K,) by*(O) +i, OF P12,n(2); (122) 
u 
Pr2,plt) = (iva1,n—V21) Pr2n—tD Gua b,*(P22— Pir) (123) 


do ,—d ; ; 
d, = + iP 21.0 Ina Ba" —2ip 120 2 Gia bs ne 


where dou = (Wi2—Wa1) | Wi2 + Wai) (125) 


is the unsaturated inversion of a single atom and 
(1/T) = Wy2+Wa4 (126) 


is the relaxation time of the inversion (compare (18)); the inversion of atom 
pis d, = (P22— P11); and x, is the cavity half-width of mode 4. 

If we retain the mode coupling caused by the conductivity o—see (103) and 
(104)—we have to add the terms to the right-hand side of eqn (122) 


py Say (@y/@,) by*. (122a) 
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These terms are of fundamental importance for the treatment of ultrashort 
pulses generated by loss-modulation. In this case, o,,, is assumed to be non- 
vanishing at least for neighbouring axial modes, and to be modulated at a 
frequency close to the axial mode frequency spacing. 

For the derivation of the above equations it was implicitly assumed that 
the upper laser level is pumped directly from the lower laser level. In reality, 
the pumping is achieved via auxiliary levels. We indicate how the equations 
must be altered for a pumping scheme given in Fig. 3. For the sake of com- 
pleteness, we write down all equations. The laser action is assumed to occur 
between levels 2 and 1. The levels 0 and 3 serve as auxiliary levels for the 
pumping process. 


3 
2 
pump laser transition 
} 
0 
Fic. 3 

b,* = (iw,—K,) bX +id ge. Pi2y (127) 

7 
Pi2n = (iv,—7) Paani dD, 9,36 N2,n—-Ni,y) b,* (128) 
Nou = Nia Wio~No,n Wo3 (129) 


Nin = No, Wa1 -Ni, Wio {teat 29 b,* —ipi2,, py Gina b,| (130) 


Noy = N3n W32—-Noa yn Wai t{...} (131) 
Nz, = No,» Wo3 —N 3,4 W32- (132) 


The N, ,’s are the occupation numbers of the ith level of the wth atom and wi, 
is the transition rate from level i to level k caused by incoherent sources 
(pump, spontaneous decay, radiationless transitions). 


3.6. DERIVATION OF RATE EQUATIONS“! ® 

Before we discuss the solution of the semiclassical equation we shall demon- 
strate how to obtain the rate equations which we have used in Section 2. We 
put b,* = B,*e®") where Q, is the frequency of mode J in the loaded cavity 
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and assume that B,* changes only slowly with time (compared to y) and also 
that N, ,—N,,,, changes slowly. The integration of (128) then yields 


= 1G y.a(No2,n—N1,n) 5,* 
a Ne emery 9 eer carat 133 
P12, py (Q,-v,)+9 ( ) 


We insert (133) into eqn (127), multiply this equation by 5,, and add the com- 
plex conjugate expression to the result. This gives us, finally 


: Gr rGuwNau-N1 ) bf. b, 
nh, = —2K,n,+ er crear rms +e0.], 134 
. oe Ly, (Qy—v,)+y a) 


where n, = b,* b, is the photon number. 

We shall neglect all terms with b,.* b,(A #2’). This can be justified as follows. 
As we shall show in Section 4, the phases of the b*’s fluctuate. As long as no 
phase locking occurs the phase fluctuations of the different b’s are uncor- 
related. Thus the mixed terms (A 42’) on the right hand side of eqn (134) 
vanish, if an average over the phases is taken. The rate equations for the 
photon numbers read thus 


A 2y | g ia |? 
= —2K,n,+ >) —— 5 (NaN ip) M2 135 
ny APR U0, -9,)2472 | 2,8 tn) A ( ) 
Equations (129) and (133) have already the required form of equations be- 
tween occupation numbers. 
When we insert (132) into (130) and (131) and apply the same reasoning, 
we find 


271 9,4? 
DIG wil (No ,—-Ni,)2, (136) 


Bae Naat Tae Sey ey 


‘ 2 
No = N34 W32—No2,n Wai -2 ora (Nou—Ni,)%. (37) 
The total set of rate equations is represented by eqns (135), (129), (132), (136) 
and (137). The rate equations allow us to determine photon numbers and 
atomic occupation numbers; they do not contain any information about 
frequency shifts. For this reason the frequencies in the unloaded cavity, w, are 
usually used. 


3.7. METHOD OF SOLUTION FOR THE STATIONARY STATE‘7) 


We now come back to the general equations (127)-(132). Because exact 
solutions of them are known only for special cases, e.g. single mode operation, 
we present an iteration which exhibits some of the important new features of 
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the semiclassical equations. We treat explicitly the two-level system described 
by eqns (122), (123) and (124), although the procedure is quite general. The 
spirit of this approach is the following. In the field equations (122) the 
polarization P acts as a source term, which in turn is determined as a function 
of the field, b,(t), via the matter equations. We determine this dependence of 
P on b, approximately and insert the resulting expression into the field 
equation (122), so that we obtain an equation for the field amplitude alone. 
We assume that by the pumping, as well as by relaxation processes, an 
inversion given by 
d® = dio (138) 


is established without laser action in zero’th approximation. We assume further 
that due to the laser process, coherent fields of the form 


b,* = B,* exp(iQ,¢) (139) 


are built up. The ansatz (139) would suggest itself, if the laser field were a 
classical field. Since it is generated by quantum processes, the ansatz (139) 
requires a deeper motivation, which will be given in the fully quantum 
mechanical treatment in Section 4. 

The complex amplitude B,* and the frequency , are unknown parameters 
which are to be determined in a self-consistent way. The frequency Q, is 
assumed to be real, because we confine ourselves in the present paragraph to 
the stationary state. Under certain circumstances B,* may not be completely 
time independent but may vary very slowly. 

Before we treat the general case of the multimode equations, (122)-(124), 
we give an example for a single mode. We assume resonance, i.e. @ = v (in 
which case one may assume in a self-consistent manner Q = w), and assume 
further, that g, the coupling constant is independent of yp and A, and g is real. 
Inserting (138) and (139) into (123) we find the atomic polarization in first 
order 

p\?, (t) = —idy B* exp (iat) g/y. 


Inserting this result into eqn (124), we may integrate this equation in order to 
find an improved value for the inversion given by 


d, = dy—4dp|B\?(g°T /y). 


We now determine an improved dipole moment, by inserting the new 
expression for d, into eqn (123). The integration of this equation yields 
immediately 


p(t) = —idy B* exp(ior) 7 (1—4|Bl?g?T/y). 
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Thus the dipole moment is known as a function of the field. The field in turn 
is determined by the polarization (or dipole moments). Thus we introduce 
p\2 into the field equation (122) and obtain the following self-consistent field 


equation 
g? 
* = (ia~—K)b*¥+d,N re b*(1—4|b|?g?T /y), 


which is a nonlinear equation for b*. The total number of atoms is N. The 
above steps can be represented by the following scheme. 


field b* = B*exp(iwt) dipole moment 
gen (123) => | (124) = 
inversion d, = dy P12~b* \ 
inversion 
{} | d, = dy—...|b|? 
—ae ee” 
(123) 
new field improved 
equation = (122) | dipole moment { 
P12 ~b* +...|b|7b* < 


The effective field equation is nonlinear in the amplitude. For the whole laser 
theory this nonlinearity is of utmost importance. To see this, we insert 
b* = B* exp(iot) into this field equation and obtain 


2 
pe dyn (1—4|b|2g27/y), 


from which we may determine the photon number n = [6]? as a function of 
the unsaturated inversion dy. If we had neglected the nonlinear term, no 
steady state would arise at all. We now treat the general case of eqns (122) 
and (123), using again the ansatz (138) and (139). 

The atomic dipole moments are changed under the influence of a given 
light field according to eqn (123) whose integration yields“ ® 


exp(iQ,,t) 
(Q, Va iy) ; 


Inserting this result into (124) and integrating this equation we obtain as an 


Piru (t) = —d,,o py By* Gua (140) 
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improved value for the inversion 
exp[i(Q,,—Q,)t] =e 
(Qy —v,—iy){1/T+i(Qy,—Q,)} 
(141) 
Using the improved inversion we can obtain further improvement in the 


atomic dipole moment. For this purpose we insert (141) into (123) and obtain 
after the integration 


d, = dot {2id, » B, BY Gna Gyr 


exp(iQ, t) 
(Q,—v,—iy) 


: exp[i(Q,, + Q,,—Q,)t] 
i{(1/T) + i(Qy — Q,)}(Qy —Q,4+0,,.—v,—iy) 


1 1 
x { ————_. —- —__-————_ ]. 142 
(as aces) (142) 


Equation (142) expresses the atomic dipole moments and thus the polarization 
as an explicit function of the field amplitude b,. 

We now take into account the reaction of the dipole moments on the light 
field. Therefore we insert the p{3),,’s, which have been obtained above, in the 
field equations instead of p,2,,. This yields 


PD ee a 0), Ba" Dua +2d,,0 2, Bi BY By Opa Dux Gua 


B,* exp(iQ, t) 
b,* = (iw, —K) b,*-i¥ d 22a A 
a (iw, —K) b, id 0,019 ual (Q,—v,-i) 
+2 > do,, By, BY, By, Gna Gas Gurr Gas 


#414243 


x exp[i(Q,, + Q,, = Q,,) t] 
[d/T) + i(Q,, ~ 2, JI£Q,, 5 Q,, a QD, aa Vu cred iy] 


1 1 ) 
So (143) 
— Q1,-Vatiy 


It can be shown that the above iteration procedure is justified if the atomic 
linewidth y is larger than the other frequencies inherent in the problem." 
Equation (143) represents the basis of the following two chapters. In fact, the 
whole further analysis consists in performing the summations and integrations 
over energies and atomic indices, and in solving eqn (143). This equation 
describes the oscillation of the modes taking into account their interaction 
through the response of the atomic system. 
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According to different combinations 2,,+Q,,—Q,, on the right hand side 
of (143), new frequencies may occur. Thus, by the cooperation of two or 
three fields an additional field mode may be created. This kind of frequency 
mixing is related to those in inactive materials. To exhibit some further 
features of eqns (143) we specialize them for single and two-mode operation. 


a. Single mode operation 


In this case, we have 
I9ul7do B*Blgy|*do,,4¢yT 
K+i(Q—@) = —i)) —*—*-4+- } ———_+—* ———._ (144) 
Gro Lo-) Ta, -we al 
We split eqn (144) into its real and imaginary parts. 
The real part is 
2x = G,,— Sn, (145) 
where Bye y dould ul?2y (146) 
"4 (Q—v,)? +y” 
and d ig \*8y?T 
= cee (147) 


°F 


After multiplication with the number of photons n on both sides of (145), the 
left-hand side represents the loss rate of photons, whereas the first sum on the 
right-hand side gives the increase of photon number due to induced emission 
from atoms with a given inversion. 

If y is small compared to the spread of atomic energies, (y/{(Q—v,)? +y7}) 
can be approximated by the 6-function, 6(Q — v,), representing energy 
conservation. Only those atoms whose transition energy coincides with the 
mode frequency within a width y participate in the process of stimulated 
emission. The second sum Sn takes into account that the inversion is de- 
creased by the laser process. This effect is called “hole burning into the gain 
curve”’. 

The imaginary part of (144) is 


(Q—@) = (AQ), +n(AQ),, (148) 
where 2 
=a! Gul (Q—v,) don 
and 4 Be 
(AQ), = 4yT>) Nl don Q= V0) _ (150) 


zp L(Q— 7 a v7 
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and the subscripts uv and s denote the unsaturated and saturated conditions 
respectively. 

According to the imaginary part, the active laser atoms change the fre- 
quency of the light mode in the unloaded cavity. The first sum on the right 
hand side has the same form as the usual dispersion formula with a given 
occupation of the atomic system. This expression could have also been ob- 
tained by second order perturbation theory. The only difference is that we 
have already used a renormalized frequency Q instead of w as occurring in 
the usual perturbation procedure. What is essentially new, however, is 
represented by the occurrence of the second sum. In it, the square of the light 
amplitude, or correspondingly, the number of photons occurs. This term 
therefore describes the change of the dispersion due to the adjustment of the 
atomic occupation to the intensity of the light field. It should be noted that 
this response of the atomic system is time independent. In the steady state 
case, the photon number adjusts in such a way that gain and loss compensate 
each other as described by eqn (145). With the photon number fixed in this 
way we can calculate the dispersion part of the right hand side of eqn (148), 
which leads immediately to an expression for the frequency shift in the 
inverted system as compared to the unloaded cavity. 


b. Two mode operation, general features 
Specialization of (143) yields 


dy lg al? do B,* Byl\g Allg xl? 
K+i(Q,—@,) = ¥ 2 -47T YS — aa 
(Q,-o d y+i(Q,—y,) p> [y+i(Q,—v,) Ly? + (Qy—y,)7] 


42 x don B,* Bil ual IG ual? 
2 [Q/T)+i(Q,—Q,)][Q,—v,—iy] 
(A’'#A) 
1 1 
x | ————— —- —— 151 


for A = 1 and 2. Whereas the left-hand side and the first sum on the right-hand 
side are the same as for single mode operation, new features occur due to the 
second and third sum. These arise from a time-independent and time depen- 
dent atomic response. 

The second sum gives rise to a change both of gain and dispersion of the 
first mode caused by the photon numbers of the first and the second mode. 
Whereas the change of the gain is proportional to 


nyy 


ae 152 
[y+ (Qy—v7I ee 
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the change of the dispersion is proportional to 


n,(Q, _ vy) 
[y? B (Q, a 7 | ; 


If we take also into account the fact that the main contributions of the sums 
over yu stem from v, ~Q,, we see that the mutual influence of both modes has 
a longer range for the dispersion than for the gain. In fact, if |Q,,—Q,| is 
bigger than y there is practically no influence of one photon number, n,, on 
the other, n,. On the other hand, the “‘hole burning” changes the dispersion. 
Whereas the hole burning stems from the time independent atomic response, 
the time dependent atomic response gives rise to the third sum in (151). This 
can be seen by following up the iteration steps performed above. Under 
certain circumstances it leads also to an appreciable frequency pushing of the 
two modes. 

In the following we give only few examples how eqn (143) may be solved. 
For a systematic treatment see the article of the present author in Encyclo- 
pedia of Physics.“ 


(153) 


3.8. THE SOLID STATE LASER WITH A HOMOGENEOUSLY BROADENED LINE 
Single and multimode laser action©®) 


The eqns (122), (123) and (124) or the eqn (143) allow us to treat two prob- 
lems: (1) the determination of the mean number of photons in the laser 
modes; (2) the determination of the actual laser frequencies. Although in 
general the solid state laser will have an inhomogeneously broadened line, we 
treat first the simpler case of a homogeneously broadened line which allows 
us, incidentally, to elucidate the general procedure for the solution of 
eqn (143). 


a. Single mode operation 


We start with the eqn (144) where we have to perform the sums (or integrals) 
over pt. Because in the present case the atomic transition frequency v does not 
depend on the atomic index yp, no integration over such frequencies is required. 
On the other hand we still have to sum up over the random atomic positions 
X,- This sum can be performed to a very good approximation by an integration 
over x, 

ye. = p| av, (154) 
7] 
where p is the density of the laser active atoms. In the following we assume 
that the unsaturated inversion, do, = dy does not depend on the position x,, 
i.e. we assume a spatially homogeneous pumping (which is also time independ- 
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ent, of course). In order to perform the integration (154), we adopt, for the 
coupling constant g, , between the light mode A and the atom yp the form 


ua =i ( A )ei021 sin(k, X,)s (155) 
corresponding to purely axial modes where @,, is the atomic dipole moment, 
and V is the volume of the active material. The integration for modes with an 
arbitrary k-direction can be done in an analogous way. 

We multiply (144) on both sides with y+i(Q—v). The real part of the new 
equation then reads: 


2ay Baal plo 12nv?|6,2|?7Tii 
~(O-o\(=7) = ee 156 
BE NN) ho ho{y? +(Q—v)*} | (156) 
and the imaginary part is given by 
gel (157) 
yt 


Equation (156) is identical with a rate equation, which determines the 
photon density 7 as a function of the pumping (via dy). The photon density, 
i, is defined as B* B/V. Solving (156) for ni yields 


— _ Gin, (A) p ( Aine ”) 
SS eee ti 158 
6TK dp : ( ) 
2hyKko A? 
ire dud) = sept P (It GeRF } i 


and A=w-— v. From (159) the laser condition (13) follows. 

Laser action starts if 7 becomes positive, or according to (159), if the atomic 
inversion dy, which is fixed by the pumping and the incoherent decay pro- 
cesses, becomes bigger than d,,,, that is, if we have 


do 2 Fines (160) 
or, explicitly? 
= 2 
Do _ N(N2—N4)o > oe :( A :). (161) 
V V 4nv*|6,>| (y+k) 


In this equation, N is the total number of atoms; (N,—N,)o is the atomic 
inversion of a single atom (without laser action); and Dg is the total (unsatur- 
ated) inversion. 
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The laser frequency Q given by eqn (157) is shifted from the frequency Q in 
the unloaded cavity to the centre of the line?” (frequency pulling). For a 
homogeneously broadened line, this shift does not depend on pump power. 


b. Multiple mode operation 


The general solution of the eqn (143) for the multiple mode case is difficult 
and has not been found yet, (at least to our knowledget). Therefore, we dis- 
cuss only the most important feature of this case. As has been explained in 
3.7, the light modes cause both a time dependent and a time independent 
response of the atomic system. Provided the spacing between the frequencies 
of the different modes is big enough, 


IQ,, _ 2,1 > 1/T, Ay # A» (162) 


the time dependent response can be neglected. We neglect further, the possi- 
bility of frequency mixing, an effect which is, anyhow, very small. With these 
simplifications the real and imaginary parts of eqn (143) lead to two sets of 
equations, which determine the photon density of the modes and the 
frequency shift. 


a. Equations for the photon densities of M modes from the real part 


oe ze _ Ky p Aine(A1) 
PP eee eee aatay ttn = eB (1- 
 din( Ay) 7 Aine(Aa) An(An) 4T do 
(163) 
= Ky a K2 = Kin p di,.(Az) 
ny, ——— +38 qty te tine = £ (1- Se 
* An(Ar) 7 7 diy(Az) tnr(Am)  4T dy 


(m equations) 


where w is to be supplemented with the index A and d,,,.(A,) is defined in 
(159). These equations are identical with the rate equations (see Section 2) and 
have been discussed in 2.7. 


B. Equations for the frequency shift 
From the imaginary part of eqn (143), using (163) we get 


_ YO, + KY 


Q ; 
: yt, 


(164) 


i.e. all modes show the power independent pulling, known from single mode 
action. 


+ A case of particular interest, which has been treated, is that of mode locking in a multi- 
mode laser. 
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3.9. THE SOLID STATE LASER WITH AN INHOMOGENEOUSLY BROADENED GAUSSIAN 
(22) 
LINE 


The essential difference between the preceding Section 3.8 and the present one 
is that we must not only sum up over the atomic positions x, but also over the 
transition energy (or transition frequency v,). 


a. One mode 


As an example we choose a Gaussian distribution for vy. It should be noted 
that other distributions can be treated similarly, or in the case of a Lorentzian 
line shape, even more simply. The sums over the atomic indices then lead to 
the following expressions 


_ pAdy ([ y-i(Q—v) _ y-i(Q-yv) 
i aa || Resp eee Pray | 
exp[ — {(v—V9)/a}7]dv 
sae a ee (165) 
where 


A = 4rv97|0,2|7/ho, 


a is the inhomogeneous atomic half-width, d) is the unsaturated atomic 
inversion, and vp is the frequency at the line centre. In eqn (165) we have 
already integrated over x, and it was assumed that the vector of polarization e 
is parallel to the atomic dipole moment 6,,. The integral over v is evaluated 
under the assumption that the inhomogeneous line width is large compared to 
the homogeneous Lorentzian line width, which is described by the imaginary 
part, y, of v. Using the result of the evaluation of the integral and splitting 
eqn (165) into real and imaginary parts, we find the following equations for 
the frequency shift and for the photon density. 
Equation for the frequency shift 


Q-w = — pAds exp(—6) 0(6)+AC, (166) 
4 
where 3 
(65) -| exp(+u?) du (167) 
0 
C = {3A*pdoT n* 5 exp (—67)}/4a? (168) 
and 


5 = (Q—vo)/a. (169) 
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The total number of atoms is N, and « is the Doppler width (compare eqn 
(165)). The first term on the right hand side of (166) is peculiar to an in- 
homogeneous Gaussian line broadening.'**) Note that this term is pump 
dependent on account of its factor do. It gives rise to a pulling of the mode 
frequency w in the unloaded cavity to the centre of the emission line, vg. The 
next term depends on the photon density (or, in other words, on the field 
intensity in the cavity) and describes a frequency pushing. In order to deter- 
mine its size, we insert the value of #, which is anticipated from eqn (173) 
below and we obtain 


Q-o = ree | o()-52 at(1- Au), (170) 


where d,,,-(5) denotes the threshold inversion, which is given by 


hax exp 6” 


dine) =F EF piv al2 


(171) 


From (170) we see that the repulsive term is small compared to the first on 
account of its factor y/«. 

Equation for the photon density. 

This follows from the real part of (165) and reads 


K (172) 


+ + —§2 
= pale E exp(—62)—3ATA——_~_—— il 


2ya 


Solving this equation for # we arrive at 


Seat Sa dy(6)\  » ™* pdy,.(0) nC) 
A= se (1) = EA (1 ).y 


This equation allows us to determine the photon-density (or light intensity) as 
a function of the pump rate, which occurs in dy and T (compare eqn (125) and 
(126)). Equation (172) can also be derived in the rate equation approximation 
(compare 2.7). 


b. Two modes 


A detailed discussion shows that the photon numbers of the two modes 
influence one another significantly only if the frequency separation AQ = 
Q,—Q, is of the order of the natural linewidth, y. For (y/AQ)? <1, we can 
therefore use the expression (173) for i,. The coupling between the two 
photon numbers decreases roughly as (y/AQ)*. On the other hand the mutual 
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influence of the frequencies is larger and decreases only as (y/AQ) if |AQ|>y 
in the same order of approximation. 
Equations for the frequency shifts 


— pAd x e 
Q,-@, = —_ exp(—6,7) ® (5,) +f, Cy +z My. (174) 


The first two sum terms are identical with those of the expression (166) for the 
frequency shift in single mode operation (we have merely to replace in (166) 
and in (169) 6 by 6, and to put C, = C). The last sum term shows that the 
frequency shift of the first mode depends also on the intensity of the second 
mode. We quote the result for a large frequency separation |AQ]|>y. In order 
to put the relative size of the various contributions in evidence, we rewrite 
(174) in the form (with (AQ)? > y’), . 


d, be pase 
Oy — oy = «FS {ait Ai a1’ +2 Ga} (175) 
thr 1 
2 | _ 3ATS, 
where n= ar PO), 91 = me (176) 
2AT 
Go AQ exp(6,?—6,’). (177) 


The first two terms (176) are exactly the same as those of one mode, which, as 
we have discussed above, give rise to a pulling of the line toward the centre of 
the emission line. The following term (177) is brought about by a time 
independent depletion of the excited atomic states due to their interaction 
with the other mode. It is thus a hole burning effect as has been described in a 
macroscopic language by Bennett.'?) There is actually a further term in 
addition to g, which stems from the time dependent response of the atomic 
system. It is, however, in general small compared to g, in an inhomogeneously 
broadened line. 


3.10. EQUATIONS OF MOTION FOR THE GAS LASER? 

a. Haken—Sauermann treatment 

The essential difference compared to the solid state laser is given here by the 
motion of the gas atoms. Consequently, the co-ordinate of a single atom is now 
given by x,+v,¢. We allow further for arbitrary angles, 0,,, between the 
vector of polarization of the light mode A and the dipole moment of the pth 
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atom. The interaction constant g,, between the mode 4 and the atom yp takes 
thus the form 


i : 
Gua = ~~ {exp[ik, (x,+v, t)]—c.c.} cos 0,;° (178) 


The summation over y runs in the corresponding equations over the positions 
X,, the velocities v, (for which a Boltzmann distribution is assumed) and over 
all angles 6,. The equations of motion can now be taken directly from those 
referring to fixed atoms, (127)-(132) or (122)-(124), if the replacement 
(178) is made. The frequency v, is simply to be identified with the centre- 
frequency vy (the Doppler broadening is automatically taken care of by the 
explicit consideration of the atomic motion). 


b. Lamb treatment 


We represent still another way of introducing the atomic motion due to 
Lamb, which is different in form from, but completely equivalent to the pro- 
cedure (178) of Haken and Sauermann outlined above. Consider an atom 
excited by some process at time f, to its upper level. Let the atom be at the 
position X, at fp and have the velocity v so that at time ¢ > ft, the atom will 
be at 


XK = Xg+v(t—fp). (179) 


The corresponding density matrix p(Xo, fo, V; t) obeys the equation 


; i 
p= —-— [H, p]-i(0.p+p.T), (180) 
where an ('. ° 
0 ¥, 


and H is the Hamiltonian of a single atom, ,, y, are the decay constants of the 
corresponding levels. The field in (180) has the argument as defined in (179), 
H = H(X9+V(t~—?o), ¢. Furthermore, p(Xo, fo, Vv; 1) has to obey the initial 
condition 


1 0 


Because the light mode interacts with a macroscopic polarization P(x, t), we 
combine the contribution of all atoms which arrive at x at time ¢, no matter 
when or where they were excited. We note that 


P(x, t) = (0,. Pi2(X, t) +02; foi (x, t)), (182) 
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where 0;; is the dipole moment matrix element of a single atom, and p,; is the 
density matrix element for the corresponding transition (i.e. the off-diagonal 
elements of the density matrix of the ensemble of atoms). Instead of deriving 
an expression for P(x, t) we therefore derive p(x, t). We assume that the ex- 
citation processes obey a Poisson distribution. Then p(x, t) is obtained by 
integration over the time (t — fp) since the last excitation collision using the 
probability distribution (1/T,) exp [— (¢ — to)/T,J, where T, is the relaxation 
time for atomic inversion. We then find 


i t 
p(x, t) = T, [. Pe [—(t—to)/T,] dto ave[a vA(Xo, tos V) 
X P(X» fo, V, 1)5 (K—Xo—V(t—to)), (183) 


where Vo, v are integration variables (space or velocity), and A(xo, fo, Y) is a 
mean excitation rate, whose significance becomes especially clear in eqn (188). 
Since p(Xo, fo, V, £) decays exponentially because of the I in eqn (180), the 
exponential function, exp[ — (t—¢9)/T,,] can be neglected if (1/T,)<yz, 75. 

c. Equivalence between the formulation of Haken-Sauermann and Lamb 


In order to demonstrate the equivalence of the formulation (180)-(183) with 
the formulation by means of eqns (127)-(132) where the matrix element (178) 
is used, we write p(x, t) in the form 


ax, ) = [ava%y,0), (184) 
with 


post) = 7 [) expl—(-10/T,] ato 
pv —o@ 


x i AVoA(Xo, toy ¥) P(Xos to V, 1)(X—Xq—W(t—to)) (185) 


and investigate the time variation of p(x, v, 1): 


dp(x,v,t) — A 1 _ : 
at a 7, t, vy) T, p-y grad, p(x, V, t) 
t yi 
+ i dtp exp[—(t—fo)/T,] Tr (x —v(t—fo), to, vy) 
re : 


x {—i/h[H(x, t), p(x—v(t— to), fo, v, t)]—-4(T p+ ply} (186) 


where A = 1( 4 Hy) 


Evidently, H no longer contains fy so that it can be put in front of the integral. 
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Thus we find, finally 


pe AGRI +v grad, p(x, v, 7) 
dt 

_ Ax, 4V)—p ¥, 1) 
= T, 


i ie 
q [H(x,t), p(x, V, t)] 
Pp 
—4(Tp(x, V, t)+ p(x, Vv, t)T). (187) 
In the theory of fluids, (187) would correspond to the local description. By 
making the substitution x = x’+vt’ (and afterwards dropping the prime) we 


end up with the substantial description: 


dp(xtvt,v,t)  A(x+ve,t, v)—A(x+v4, v, 1) 
dt 7 T 


P 


—~ (i/h)[H(x + vt, 1), p(x+vi, v, t)]—-4 {Tp(x+ ve, v, 1) + pT}. (188) 


When we split the matrix p into its elements, we obtain from (188) the eqns 
(128), (130), (131) (in a specialized form, where Np acts as reservoir (compare 
2.4) and N; is eliminated using the fast decay rate of level 3). Most importantly, 
Pu(x +t, v, t) is replaced by p;, ,(t). The equations of the atomic motion (188) 
are thus equivalent. 

The equivalence of the corresponding field equations can be shown even 
more simply. . 


3.11. SINGLE MODE OPERATION IN GAS LASERS‘?>) 


The method of solution consists again in the iteration procedure described in 
3.7. We have to repeat it, however, because the coupling coefficients depend 
also on time. Making the hypothesis 


b*(t) = B* exp(iQs), (189) 
we obtain after the above iteration steps in the same approximation 
[x + i(Q—a@)]B* exp(iNt) = i> g,*(1)p12,,(2)- (190) 
iu 


In this equation, g,*(t) is the coupling constant defined by (178) and p,2 ,(¢), 
in the second approximation, is given by 


P12,y(t) =B* exp(iQr){c, exp[i(kx, +vt)]+e_ exp[—i(kx,+vt)]} 
+nB* exp(iOr){d, exp[i(kx, + vt)]+d_ exp[—i(kx, + vt)]} 
+f exp[3i(kx, + vt)]+f_ expl —3i(kx, +vt)] 
+j,exp [i(kx, +vt)]+j_ exp[—i(kx, + vt)], (191) 
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where ci, d+, 4, /4 are complex constants which are independent of space 
and time, but depend on the mode frequency and on the velocity of the single 
atom. When we multiply (191) with g,* (see (178)) and sum over the co- 
ordinates, the time dependence with exp(ivt) and exp(3ivt) drops out on 
account of the orthogonality properties so that the right hand side has exactly 
the same time dependence as required for b* by the ansatz (189). 

Integrating over x and averaging over @ explicitly and assuming a Gaussian 
velocity distribution w(v) as well as a homogeneous spatial distribution of the 
active atoms, we obtain the following two equations (« and f) for the photon 
density and frequency up to order y/«. 


a. Equation for the photon density 
n* pAdy exp ~6? 3 1 y 
: 6a | a (5 aR | oe) 


where A is defined in (165) and d, is the unsaturated inversion of a single 
atom. From this we can obtain the photon density 


107, 3duu(6) yoyo 
t= sar(\-“F ae oa 


The photon density shows a dip (known as Lamb dip) if plotted as a function 
of frequency. It is interesting to note that this dip is brought about by the fact 
that the atoms move in both directions, so giving rise to two holes being 
burned into the inhomogeneously broadened line at two symmetric points of 
the line shape. If we have fixed atoms instead no such dip occurs. 


(B) Equation for the frequency shift 


Q-o = — se exp(—67) (5)+7C’, (194) 


where A, 6, and ©(6) are defined in eqns (165), (169) and (167), respectively, 
and C’ is given by 


_ A*pdoT x* 


1 Q—Vo 1 rs) 
Cc’ — 
20 


—_ 2 rn — 
xp 8 y4+(Q—y) 5 a at?) 


We compare this result with the expression (166) for fixed atoms andasingle — 
direction of polarization. The first sum term on the right-hand side in (194) _ 
agrees with the corresponding one in (166) apart from a factor 1/3 which 
stems from the integration over the polarization (only 1/3 of the atoms 
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participate in the laser process on the average). This term represents power 
dependent mode pulling and stems from the Doppler shape of the line. 

With respect to the term proportional to n which comes from the time 
independent atomic inversion, we observe that the part of C’ which stems 
from the second sum term in the curly bracket agrees with the total expression 
C (eqn (166)) apart from a numerical factor which stems from the integration 
over the polarization angles again. This second term in C’ is generally much 
smaller than the first sum term (provided the detuning is not too strong 
(5<1)). It describes the frequency pushing which is due to the existence of two 
holes burned into the inversion (note, that a standing wave interacts with 
atoms whose frequencies are shifted both by +kv and —kv, where k is the 
wave number of the mode and v is the velocity of the atom). The frequency 
pushing becomes dominant if the mode is tuned to the centre of the line within 
about a natural line width. 

Inserting fi given by eqn (193) into (194) gives, finally 


A ‘ dy 
Q= aK) exp ) du-+( 5555-1) 


y(Q— vo) RE) 


2 fe i 
x (-= | exe aa ee (oT ’ 


4. THe FULLY QUANTUM MECHANICAL TREATMENT 


4.1. NEED FOR A FULLY QUANTUM MECHANICAL TREATMENT 


In the foregoing Section 3 we have seen that the semiclassical equations allow 
us to treat a great number of laser phenomena. These equations have a 
fundamental drawback, however, they predict that laser action should occur 
only above a certain threshold, and that the light has an infinitely sharp line 
(the phase and amplitude are fixed quantities), whereas below that threshold 
no laser action should occur at all. It is, however, evident, that even below 
threshold the laser emits light (any ordinary lamp does so) and it is further 
known that laser light has a finite linewidth above threshold. 

Let us discuss the reason for these effects by first considering a region far 
below threshold, where the laser acts as a usual lamp. Its light stems from 
spontaneous emission, which manifestly is a quantum mechanical process. 
When the inversion is increased, the spontaneously emitted light paths are 
enhanced by stimulated emission. This effect has been treated by a great 
number of authors, using a semi-classical or a quantum mechanical linear 
response theory. They predicted a finite linewidth which decreases with in- 
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creasing inversion. The field should be a superposition of statistically inde- 
pendent wave tracks so that its amplitude possesses a Gaussian distribution. 
For quite a while it was rather generally believed that these theories repre- 
sented the whole story. There was still a puzzle, however—why should one be 
allowed to use /inear theories to explain the linewidth, while for all other laser 
phenomena the nonlinearity (treated in the theories of Haken, Sauermann and 
Lamb) played the decisive role? Thus a theory was needed which was both 
quantum mechanical and nonlinear. In 1964 we developed such a theory.? 
It revealed immediately that the statistical properties of laser light differ 
basically below and above the laser threshold. Well below threshold the 
results of the linear treatments were re-obtained. Above threshold, the field 
amplitude was stable with small quantum mechanical amplitude and phase 
fluctuations superimposed on it. Speaking more mathematically, it turned 
out that the creation operator b,' of the mode A could be split into a c-number 
and a residual operator. The c-number part is exactly the one, which is used 
in the semiclassical equations. The predictions about the amplitude fluctua- 
tions were fully substantiated by experimental results published shortly 
afterwards.?” 

Our equations can be considered as quantum mechanical analogues of the 
Langevin equations of classical physics. Besides this kind of laser equation, 
which have been subsequently used by a number of authors (Sauermann,?® 
Lax,?» Haken,? Arzt et al.°) two further methods were developed, which 
proved to be more or less equivalent to that approach: the laser master- 
equation (derived by Weidlich and Haake in 1965°) (see also Pike and 
his chapter in this volume; Scully and Lamb,°* Fleck,@” Willis,@> 
Weidlich et al.,°© Gnutzmann and Weidlich®”) and the Fokker—Planck 
treatment of Risken.©® Risken interpreted our quantum mechanical Langevin 
equation as a classical one and established its associated classical Fokker— 
Planck equation. The exact stationary solution allowed him to predict the 
smooth change of photon statistics from below to above laser threshold. 
His predictions, as well as further ones on correlation functions, were fully 
substantiated by experiments.%% There remained, however, a challenge for 
the theoreticians working in that field: why is one allowed to use a classical 
Fokker—Planck equation for a definitely quantum mechanical process? Or in 
other words: how can one derive a Fokker—Planck equation from a quantum 
mechanical equation without losing the quantum mechanical information? 
A first important step in that direction was taken in the paper of Weidlich and 
Haake. When they chose the density matrix in the coherent state representa- 
tion (see Glauber’s chapter in this volume), they obtained a partial differential 
equation of the type of a classical Fokker—Planck equation. A further step was 
taken by Lax and Louisell,“@° who derived a Fokker—Planck equation for the 
field using a representation equivalent to the diagonal coherent state representa- 
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TABLE II 


Results which can be checked by experiments: 


correlation functions for phase and Photon distribution 


amplitude (intensity) fluctuations, higher order correlations 


- * exact or 
TtT approx, direct approx, numerical 
~~ solution solution solution 


Fokker—Planck 
equation field 


Langevin equation 
field 


density matrix equation 


field 


photon 
number 
represent. 


coherent 
state 
represent. 


Fokker—Planck equation 
with q.m, drift and 
diffusion coefficients, 
field and atoms 


neglect of higher 
order derivatives 


“Quantum mechan. . . : ‘ i ’ 1. Fokker— ] 
j i Density matrix equation generalized general. Fokker. 
| Langevin equations tela + ete distribution Planck equation | 
| field + atoms inetiod field + atoms 
Heisenberg : PAP | 
| . Schrédinger 
| a picture | 
| Total Hamiltonian This box | 
| field, atoms, heatbaths contains the 
L exact equations | 


+ A: adiabatic elimination of atomic variables 
tft E: exact elimination of atomic variables 
ttt L+Q: linearization and quantum mechanical quasilinearization 


Family tree of the quantum theory of the laser 


Approximation methods 
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tion (see Glauber’s chapter in this volume). Their treatment of the field is now 
generally adopted (although there may be some pathological cases where it 
fails, as shown e.g. by Mollow and Glauber). The laser equations contain, of 
course, also the atomic variables or, as we will see, the atomic operators. 
Thus the task remained to derive a Fokker—Planck type equation for those 
atomic operators. Schmid and Risken‘*’) proposed a distribution function for 
a single atom which contained all the relevant atomic operators in a sym- 
metrized form, and derived the first two moments for the Fokker—Planck 
equation. Lax”? first confined himself to the occupation numbers. Because 
these quantities commute with each other and with the field operators, this 
treatment is trivial, however, as stated by Lax and Yuen.) Finally, a com- 
pletely exact generalized Fokker—Planck equation was derived for all non- 
commuting macroscopic variables of a ground state laser (Haken, et al.4*), 
If one keeps the leading terms, this equation reduces to one essentially equiva- 
lent to that proposed earlier by Risken et al.“¢°) The method of Haken et al. 
was adopted by Lax and Yuen“* to treat an upper state maser, while 
Gordon® used a distribution function for the atoms of the Schmid—Risken 
type. Recently, the explicit form of the generalized Fokker—Planck equation 
for an arbitrary dissipative (or non-dissipative) quantum system, provided its 
density matrix is given, has been derived,“” so that this problem can be 
considered as completely solved. This equation is derived in Section 5 of this 
chapter. 

The logical connection between the different approaches is shown in Table 
II. In view of the numerous approaches which often differ but slightly, one 
cannot help having the impression that the problem of the quantum theory of 
the laser has been “‘overkilled” (as may be the case with many contemporary 
problems of physics). On the other hand these methods, which allow us to 
deal with systems far from thermal equilibrium, may have important applica- 
tions in other fields of physics, e.g. nonlinear quantum optics (parametric 
optical oscillator, second harmonic generation), critical phenomena of light 
propagation in continuous media, super-conductivity (e.g. Josephson 
junctions), nonlinear spinwave theory, tunnel diodes, etc. 

In our present treatment we proceed as follows. We first derive the quantum 
mechanical Langevin equations for a single mode laser and indicate their 
solution and the physical consequences. We then sketch the derivation of the 
laser master equation, derive the exact generalized Fokker—Planck equation 
and discuss the approximations which reduce it eventually to Risken’s 
original equation. Finally we discuss its numerical solution and compare it 
with experiments. To readers who are not so much interested in the sophisti- 
cated quantum mechanical derivation of the Fokker—Planck equation we offer 
an alternative derivation in the framework of the purely classical theory 
(4.12.c). 
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4.2. THE HAMILTONIAN AND HEISENBERG EQUATIONS OF MOTION 
FOR THE LOSS-LESS SYSTEM 


To avoid unnecessary complications, we treat a single mode represented by a 
running wave, and a system of atoms with two levels and a homogeneously 
broadened line. Occasionally we shall indicate the changes involved when 
several modes are present, or when three-level atoms are used, or when the 
line is inhomogeneously broadened. 

The Hamiltonian of the total system (field and atoms) may be derived by 
the standard methods of quantum-electrodynamics (see for example, the 
chapter by Kibble in this volume). 


The field 
The vector potential A(x) of the electromagnetic field is decomposed into the 
““cavity”’ modes U,(x) thus, 


2nc*h 


Aw) = > / i+ 6,)U@ (197) 


where w, is the mode frequency in the ‘‘unloaded” cavity and b,' and b, are the 
creation and annihilation operators of the mode A. These operators obey the 
commutation relations 


b,b,'—b,* by = O44 
b,b,—b, b, =0, (198) 
b,t b,/—b,} bt = 0. 


The Hamiltonian of the field is given by 


Hy = dheo,b; b,. (199) 


In the following we retain only a single mode and drop the index 4. The 
Hamiltonian then reads 


H, = hob* b. (200) 


The matter 


In accordance with the foregoing sections, we consider a set of laser-active 
atoms, which we distinguish by an index p and which are embedded in a solid 
state matrix at random sites, x,. We treat explicitly only a single electron 
(‘‘Leucht-electron”’) of each atom. The electrons are described by the formal- 
ism of second quantization. The creation and annihilation operators of the 
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electrons of the atom y in the state j (= 0, 1, 2, ...) are, respectively, a,{ and 
a,;. These operators obey the anticommutation relations 

a,j! Ayy Fay y a,;' = Say Ojj"> 
By j Ay j + Ay i Ay; =O (201) 
a,} Ay; + Ay ;* a,; = 0. 


The Hamiltonian of the total atomic system reads 


H, = FE nj Fujt Ay js (202) 
HJ 


where E,,; is the energy of the level j of the atom uy. In the following we treat 
mainly two-level atoms with a homogeneously broadened line. Choosing 
E,, = Oand putting E,. = hv, we obtain 


H, = hv} ayo ay. (203) 
it 


The interaction Hamiltonian 


The Hamiltonian describing the interaction between a set of laser modes A 
and the atomic system yp is given by 


H;=h pa {(a,! as) b, Sua” + (a, a2), bt Guat (204) 
iy 


where 


(a, a1), = @,2' a,1, ete. 


The terms (a,' a,), b, describe the excitation of the atom yp connected with the 
absorption of a photon of the mode, whereas (a,' a2), b,' describes the inverse 
process. To obtain H; from usual quantum electrodynamics we made two 
approximations: 


1. we neglected terms quadratic in A (i.e. terms quadratic or bilinear in 
b,, b;'); 


2. we neglected antiresonant terms of the form a, a,t bt, which describe the 
simultaneous electronic excitation and creation of a photon and a,! a, b, 
which describe the simultaneous electronic recombination and annihila- 
tion of a photon. 
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The coupling constants g,,, are in the so-called dipole approximation identical 
with the ones given in eqn (121). When we consider only a single mode, we 
write 


ie {bY (as a.g.t +BY (ait a), an}: (205) 
iB H 


The total Hamiltonian 


This is given by 
H =H,+H,+H;. (206) 


The equations of motion 


For the laser theory it proved most fruitful to treat the Heisenberg operator 
equations, because they exhibit a close correspondence to the semiclassical 
equations (122)-(124). Using the relation 


dQ i 
—— 2 [HO 
a = y LO (207) 


for the time dependence of an arbitrary operator ©, we find for the creation 
operator b,' the equation 


bt = i@, b{+i> Gya* (a2 As) a: (208) 
B 


With respect to the electronic operators a, }, a, ;, it turned out advantageous to 
derive equations of motion for pairs (a} a,), rather than for the single opera- 
tors a,,! or a, ;. We obtained 


(a,' a1)", = iv,(a,t as) =f 3 Gua bt (a, a2—- a, Q1)y (209) 
Bu 


and a corresponding equation for the hermitian conjugate of (a,' a,),,. Because 
the operator (a,' a, —a,' a,),, which describes the atomic inversion, occurs in 
eqn (209), we have to derive a further equation for it. By letting 


Co, = (a,t a2 —a, Ay) ys (210) 


we obtain 


6, = 2i 3 (CPAGAY a2), b —8,,*(a,! 41), b,). (211) 
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4.3. THE HEISENBERG EQUATIONS OF MOTION INCLUDING DAMPING 


The quantum mechanical Langevin equations 


To compare these quantum mechanical equations with the semiclassical ones 
(122), (123) and (124), we note that by making the following changes 


bt—>b*;  ayh ay py23 Sy > dys (212) 
we see immediately that both sets of equations are identical apart from one 


basic difference, viz. the semiclassical equations contain damping terms as 
follows . 


(122): —Kb* (213) 
(123): —?P12 (214) 
do,—d, 

(124): = (215) 


These terms are not in the quantum mechanical equations (208), (209) and 
(211). In Section 3 we have seen, that just these damping constants determine 
qualitatively and even quantitatively the laser phenomena, threshold, in- 
tensity, actual frequency, etc. Thus the terms (213), (214) and (215) must be 
incorporated into eqns (208), (209) and (211). Let us try to do this first in 
eqn (208), where we put g = 0 for the time being. The solution of the equation 


bt = (ia—x) bt (216) 


bt = bt(0) exp[(iw—xk) #]. (217) 
Inserting this into one of the commutation relations (198) we find that 


[b, bt] = exp—2xt, (218) 


i.e. the commutation relations (198) are violated. 

The reason for the violation of the commutation relations can be found as 
follows. The damping of the field is caused by other heatbaths or reservoirs. 
For instance, the losses in the mirrors stem from the interaction of the light 
field with the electrons in the mirrors. Thus, in order to describe the damping 
properly, we must include these reservoirs in the Hamiltonian from the very 
beginning. When we now try to eliminate these heatbaths, it turns out that 
they lead not only to damping but also to additional fluctuations. These 
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fluctuations act as random forces on the field operator b', so that the quantum 
mechanically consistent equation must read 


bt = (io—x) bt + F1(1) (219) 


instead of (208). 

The forces Ft(t), F(t) derived by Senitzky“® are non-commuting operators 
which have quite general properties which do not depend on the special heat- 
bath used. In the appendix we show how the heatbath co-ordinates can be 
eliminated and how these quantum mechanical Langevin forces arise. In fact 
the analogy with a classical Brownian motion is quite close. The fluctuating 
forces F are of fundamental importance for the determination of the phase- 
and amplitude-fluctuations of the maser (microwave region), because they 
represent thermal noise. In the optical region the spontaneous emission noise 
plays the decisive role. This effect was first considered by Haken‘? in the 
following way. The eqns (209) and (211) alone are not sufficient to treat 
spontaneous emission by the Wigner—Weisskopf method, but must be supple- 
mented by the initial condition that the atom is in its upper state. In the case 
of the ammonia maser this “‘preparation’’ of the atoms is done when they are 
shot into the cavity at random times f,. In the case of fixed atoms the prepara- 
tion of the initial state is achieved by the pumping process. In a completely 
symmetric way the initial state is also prepared, for example, by the spon- 
taneous emission into the nonlasing modes (which form a continuum). 
Because eqn (209) is a first order differential equation, such initial conditions 
can be built in by adding on the right hand side a term 


T21(t) = py o(t = tuy){(aa' As) ul tay +t 0) _ (a,t As) ultay i 0)}, (220) 


where, for example, (a,' a,), (t,, +0) is the “value” of the operator immediately 
after the pumping process. One may show that the I’s are non-commuting 
operators with the property 


Ty =0, Taian oly, j, 2 2) = Gai, 5, ,56(4—-f), (221) 


where the 6-function expresses the fact that the external perturbations (“heat- 
baths” or “reservoirs”) which “prepare” the initial state have a short memory 
(compared to the other time-constants in the system), or in other words, that 
they are “‘Markoffian’’. 

Another, more rigorous, way is to couple the electrons of the single atoms 
to heatbaths which represent the pumping, the spontaneous decay into non- 
lasing modes, interaction with lattice vibrations, collisions etc. and to elimin- 
ate then these heatbath variables. This has been done for a two-level system in 
detail by Sauermann.“® Another method to derive the properties of the 
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coefficients G in equation (221) which was used by Haken and Weidlich©” 
relies on the requirement of quantum mechanical consistence. Summarizing 
their method, we first consider equations for the occupation number average, 


d 
at <az' a> = Wy2<ay' a,> —W21<a2' a>, (222) 
and the average for the dipole moment operators 


d 
dt <a,! a,> = (iv—-y) <a, a>, (223) 


for which we may write down phenomenological equations in the sense of the 
semiclassical treatment, where w,>, w2, are the atomic transition rates, v is the 
circular frequency of the atomic transitions, and y is the atomic half-width. 
The quantities 


a} a, = Pi (224) 


are projection operators which project from one state to another and which 
obey the following relation for projection operators in the single electron 
space 


Px Pin = Pie Sua (225) 


This relation is violated if the operators are forced to obey equations with 
damping constants. In order to save the property (225), we introduce fluctu- 
ating forces I(t) with the property (221) so that our fully quantum mechanical 
equations now read for the operators of the two level system: 


d/dt (a,* a.) = W,2.4,' ay —W 2, a2! a, + T2,(t); (226) 
d/dt (a,' ay) = —Wy2a,' ay +Woy a, a,+T,,(t); (227) 
d/dt (a,* a,) = (iv—y) a,t a,4+T2,(0). (228) 


The properties of the fluctuating forces [ are derived in Section 5 for an 
arbitrary quantum system coupled to Markoffian heatbaths. We quote a few 
examples: 


Gi2,21 = 2y(ay* ay> + ao" az>) 


G2, 12° 0, G21, 20> 0. 
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In particular it turns out that the relations remain unaltered, if the action of 
external fields is admitted. Thus we can combine the form of eqns (226), (227) 
and (228) with those containing the coherent interaction: 


bt = (i@,—K,) by +iY gi@r' ay), t FiO) (229) 
Bu 


(a,' a),° = (iv,—y)(a2" ay:),-i>d, Gpa b,*(a,' a,—a," Qy)utT21,,(t) 
4 (230) 


. dy-o , 
G, = Saar py (s.aCas" a2), b,' —gi,(a,' a1)y b,] +1, (231) 


where 
W394 3. 


1 
WigtW2, ~T 


dy = =WiytwWa; T= T22-Vis- 


4.4. THE QUANTUM MECHANICAL LANGEVIN EQUATIONS FOR A SET OF FIELD 
MODES AND A SYSTEM OF N LEVEL ATOMS. CONNECTION WITH THE 
SEMICLASSICAL EQUATIONST 

The methods given in Section 5 enable quantum mechanically consistent 

equations for an arbitrary quantum system to be derived. In the case of the 

laser, the corresponding equations are: 


a. Quantum mechanical equations 
a. The field equations 


* b,t = (ia, —K,) b,' +i ¥ Ginna" a),+F a(t), (232) 
where b,‘ is the creation operator of the cavity mode A, w, its frequency, and 
t, = 4x, its lifetime (both in the unloaded cavity). (a; a,), is an abbreviation 
for a,,' a; where a,,' is the creation operator of an electron in the level j at an 
atom p (4 denotes, for instance, the lattice site) and a,,, is correspondingly the 
annihilation operator. gj, describes the coupling of the mode / with the 
electron at atom y, which causes a real or virtual transition between levels / 
and j. The fluctuating forces F,(t) are characterized by the properties 


CF A(t) Fy(t')> = 2x, 0,(T)6(t—-t’) bay 
CF A(t) Fatt) = 2x, (1,(T) +1) 5-0) buy 
CF, Fy) = <F,' Fy") = (Fy) = CF, = 0. 
n,(T) is the number of thermal quanta at the temperature T. 


+ The reading of this section is not required for the subsequent ones. 
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(B) The matter equations 


d 
at (a,* ay) = Wi (a;* ay) a : > Gjtpa (b, +b,){(a;' ay), 6; po (a,* ay) Sy} 
—yj-p(a;* ay), 1 —dpy) +67 {x (Way (ae any 


—(a;* ay), W;- »| +Tpy,(t) (233) 


E, is the energy of an electron in the level j at atom yp. The inhomogeneous 
broadening is taken into account by letting the E’s explicitly depend on y. 
6,; is Kronecker’s symbol (= 1 for 1 = j; = 0 for 1 4 j). The Is are the 
fluctuating forces. It is assumed that they are uncorrelated for different y’s 
and have the properties (221) where the coefficients in (221) are given by (490). 
T he semiclassical equations. If we take the average with respect to the heatbath 
variables which still occur in the F’s and I’s, the fluctuating forces vanish. 
We take further traces with respect to the electronic states and the light field 
and make the approximation that © 


thotai(O, a} a; Prot) = trieia(Oa Prieta) tr,(a,' Gq; Patom) (234) 


Prov Pricia 20d Patom are the density matrices of the total system, of the field 
and of the atoms, respectively. Then, using 


tr.,(a," qj Pasa) a Pytn,tttiera(Oa Prieta) = <5), (235) 


we obtain finally from (232) and (233) the so-called “semiclassical” density 
matrix equations. 


b. “Semiclassical” density matrix equations©» 


When we specialize the above equations to the four level case and apply 
the rotating wave approximation, we obtain the equations (127)-(132) with 


ay . eae * 
Girpa = Gnas Jripa — Gua . 


4.5 QUALITATIVE DISCUSSION OF SINGLE MODE OPERATION(?°°53) 


Some transformations 

A series of implications of the above equations can be explained by the 
following specialization. We consider a system of two-level atoms with a 
single transition frequency v, which are in resonance with a single, running 
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cavity mode. The equations of motion then are 


bt = (ia—x) bt +i¥ g exp (—ikx,) (a,t a), + F', (236) 
“ 


(a,' a1)" = (iw —y)(a,* @1),—ig exp (ikx,) bt(a,! a2 ~a,t Qy),+ Vat. (237) 
together with the complex conjugate equation, and further 
(a,! a2),° = (a,* ai) Wi2- (a,' a2), Wat +i(a,t Ar) In bt 
—i(az* ayy Gu* b+ 22, (238) 
(a,' Ay) y" = (a, a2), Wai (a, Q1)y Wi2— i(a,* a2), Gu bt 
+i(ay’ ay), 9,7 64+T 11, (239) 


We eliminate @ and the factors exp (ikx,) and exp (—ikx,) from the above 
equations by the transformations 


bt bt exp (iat), b->b exp (iat), (240) 


(a2! a,),—>exp (ikx,) exp (iat) (a,* a1), 


(a,' a), exp (—ikx,) exp (—iot) (a,' a2), a)) 


We then put Ft = Ftexp (iwt)andI,, = exp (iwt) f',,. Because the equation 
a,‘ a, + a,' a, = 1 holds exactly, we introduce the new variables 
aj'a,+a,ta, and (a,'a,—a,'a,) =9,. (242) 


The eqns (236)-(239) then take the very simple form 


bt = —Kbtig ¥ (a, a,),+ Ft, (243) 
LB 
(a,! ay), = —y(a,t a,),—igb' OntP arp (244) 
d,— 

é,= mH 2ig(ay! a, ba) a3), hE s (245) 

where reer 
rm miae 1 (246) 

Wr2t+Wai 

and (1/T) = (wy. +21) (247) 


and T is the relaxation time of the atomic inversion. 
Because the mode interacts with the atoms through a sum, we are led to 
introduce )° (a,/ a,),, as a new variable S*t and correspondingly to put 
“ 


S=Yo,. (248) 
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The eqns (243)—(245) thus simplify (by summation over p in (244) and (245)) 
to give 


bt = —xbt+igSt+ Fr, (249) 
St = —yS —igbt S+T,, (250) 


T,, = Y Boag 
“ 


where 


. SoS 
S= = + 2ig(S~ bt —bS*)+T2.—-T yy, (251) 


ly, = d Dyins 


So = Nd. 


(252) 


and N is the total number of atoms. The variable S* can be visualized as the 
macroscopic, total dipole moment of all atoms, while S = (N,—N,) is the 
total occupation difference (total inversion). S*, S~ and S can also be 
interpreted as operators for the total spin 


S= (S,., Ds S.); 
where 
St= S,+iS,, S”~ = S,—iS,, S = 2S.. 


From the first two equations we eliminate S* and we substitute S* in eqn (251) 
by means of eqn (249) and S~ by means of an equation which is the complex 
conjugate of eqn (249). This leaves us with 


oa bt+ (x+y) bt +(xy—g? S) bt = ig f+ Ft+yFt, (253) 
where we substitute ; 

igf,,+Ft+yFt = F,,,' (0), 
a) § = Sox _ aot by 420 bt b— (Ot F+ Ft) 40 y—T yy. 
(248) T (354) 


a. The linear range (subthreshold region) 


For an interpretation of eqns (253) and (254) we treat b‘ as a classical, 
complex, time dependent variable, which we decompose into its real and 
imaginary parts 


bi(t) = x(t) +iy(0). (255) 
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Equation (253) can then also be decomposed into a real and an imaginary 
part. The resulting equations then have the same appearance as those of a 
classical particle, moving in two dimensions under a force 


RK, = —(ky—g? S(x,y))x, Ky = —(xy—-g? S(x, y)) ». 


Although the motion of the light mode has, of course, nothing to do with a 
particle motion, this formal analogy is most useful for an understanding of the 
nonlinear eqns (253) and (254). Consider first the case, that the inversion 
S = (N,—N,) is kept fixed, which is actually done in the linear theories of 
laser noise, or in the amplifier range. Let us start with a small inversion, so 
that 


ky—g?S>0. (256) 


Equation (253) is then that of a damped harmonic oscillator, subjected to a 
random force K(t). This random force can be interpreted in such a way, that 
it exerts pushes at a random time sequence. After each such excitation the 
“particle falls down the potential curve” (compare Fig. 4). When we increase 


v(x) 


Fic. 4 


the inversion S, (xy—g? S) becomes smaller so that the particle comes down 
the potential curve more slowly. Translating this statement into the language 
of waves, we may say, that the modulation frequency, caused by the fluctua- 
tions, becomes smaller, so that a line narrowing results. The pushes in F,,,, (¢) 
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stem from statistically independent events. Consider for instance I’,, which is 
equal the sum of the atomic noise operators T’,,,,. Each operator is respon- 
sible for the fluctuations of the atomic dipole moment, which are indeed 
statistically independent. As we will see later explicitly, ',, is responsible for 
the spontaneous emission noise, while F accounts for the thermal noise of the 
walls and the vacuum fluctuations. Each emission process creates a contri- 
bution to bt. Due to the flatter curve the amplitude falls down more slowly, or 
in other words, the losses are partly compensated by induced emission. Be- 
cause the total amplitude bt is a sum of many statistical contributions, the 
probability P of finding a definite value of bt is a Gaussian, 


P(b) = const exp(—|b|?/|bol)- (257) 


When S becomes so big, that xy—g?S > 0, the system seems to become 
unstable (compare Fig. 4). In the linear theories, S can never become bigger 
than S, = xy/g?, because otherwise the light output ~|b|? would become 
infinite. These latter considerations are, however, incorrect, because they 
neglect the dependence of S on b' completely. 


b. The nonlinear range (at threshold and somewhat above) 


Our discussion is based on eqns (253) and (254) where we investigate 
now the consequence of the dependence of S on 5. On account of the line 
narrowing we may expect that the frequencies inherent in (b* b) are smaller 
than x so that we neglect (b' b): compared with 2x bt b. Because the atomic 
relaxation time T is also small compared to times inherent in (b' 5) (again on 
account of the line narrowing) S can follow the motion of b! b adiabatically. 
Neglecting the fluctuating terms in (254) we thus obtain S ~ Sy — 4k Tbt bso 
that (253) takes now the form 


bt+(«+y) bt + (xy—g? Sp) bt +4g? xTHt bbt = F,,, (¢), (258) 
which can be understood as the equation of a particle moving in the potential 


2 

VilbI) = (ey —g? So) + g? «TIBI. (259) 
For xy — g? S > 0, i.e. small inversion, the potential curve has qualitatively 
the same appearance as discussed under (a). For ky — g? S < Othe amplitude 
oscillates around a new stable value r7 ¥ 0. Because the potential has rotation 
symmetry, there is no restoring force in a tangential direction. Thus the phase 
undergoes some kind of diffusion under the action of the random force 
F,,.(t). We write 


bt(t) = expLid(t)] (rot pC?) (260) 


where ro is the stable value of the mode amplitude and #(t) is the fluctuating 
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phase of the mode, and discuss the phase diffusion and the amplitude fluctua- 
tion p in more detail. 


a Phase diffusion. With increasing S the stable value ro, of the mode amplitude 
increases. Consequently the path of the particle, and thus its migration time, 
increases. Because in a diffusion process we have Ax? oc t, and fora tangential 
motion Ax = ry Ad we have 


(Ag)? & t/ro’. (261) 


The factor of ¢ (time), however, is just the linewidth. With increasing inversion, 
the linewidth Av due to phase diffusion is proportional to 1/r9” and, because 
the coherent output power, P, is h@2xry”, we see that 


Ay o 1/P. (262) 
B Amplitude (intensity) fluctuations. With increasing ro, the slope of the 
potential curve becomes steeper in the vicinity of ro. Therefore: 


1. the oscillation frequency increases, so that the relaxation time of the 
amplitude fluctuation decreases and t oc 1/P; 


2. the fluctuating part, p, of the amplitude decreases, so 
<p?> x 1/P. (263) 
c. The nonlinear range at high inversion 
With increasing inversion the oscillation frequency of the amplitude 
fluctuations becomes finally bigger than 2x, so that the dominant terms in 
(254) are given by 


So —2(bt b): (264) 


So a S 
T 
(the terms with F and I',,—I,, are not completely negligible, however) 

which, for rapid oscillation has the solution 
S = const — 25" b. (265) 
Thus the behaviour is qualitatively similar to that of the region (b). 
d. Exact elimination of all atomic coordinates 
For the qualitative discussion we have expressed S only approximately by 
means of bt, b. However, this can be done exactly, 


S= dy—2] exp (-5¢-») {(bt b) +.2«(bt b)— (bt F+ Fb) 


—4(T22-Ty,)} de. (266) 
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Inserting (266) into (253) yields an exact equation for the b-field alone. 


e. The quantum mechanical van der Pol oscillator 


As we have just discussed, in a region not too far away from laser threshold, 
the actual relaxation times of the field are smaller than the decay constants x 
and y. This allows us to neglect the term bt compared to (x + y) bt in eqn 
(253). Inserting S = Sy — 4xTb' b into (253), and neglecting the cavity half- 
width x compared to the atomic half-width y we obtain the equation 


~ 


bt+(e-Ls 42 act nee aR (267) 
y 0 y a y tot ° 


The equation for the operator bt may be considered as the quantum mechanical 
analogue of the van der Pol equation in the rotating wave approximation with 
a noisy driving force (1/y) F,,, (t). In the following Section 4.6 we solve the 
exact eqns (253) and (254) rather than the approximate eqn (267) by the 
method of quantum mechanical quasilinearization, because no substantial 
additional difficulties arise. On the other hand the final form of the Fokker— 
Planck equation near laser threshold can be obtained in a more or less 
heuristic way directly from (267), if this equation is interpreted as a classical 
(c-number) equation. We shall follow up this line in Section (4.12.c) so that 
the reader may proceed after having read Section 4.6 right away to Section 
(4.12.c), unless he is interested in the rigorous quantum mechanical treatment. 


4.6. QUANTITATIVE TREATMENT OF SINGLE MODE OPERATION 
(QUANTUM MECHANICAL QUASI-LINEARIZATION)°?°?>*) 


Somewhat above threshold the eqns (253) and (254) can be linearized by 
the ansatz (260), keeping only terms linear in p and ¢. Note that in the quantum 
mechanical treatment p and ¢ are still operators while rp is a c-number (in 
these lectures we refrain from a representation of the precautions which this 
operator treatment requires). We first eliminate S in eqn (254) by inserting the 
expression (266) into eqn (253). Then we introduce the ansatz (260) and split 
the resulting equation for ro, p and ¢ into its “real” and “‘imaginary”’ parts. 
In zeroth order, i.e. no fluctuations present, we obtain an equation for ro, 
which is identical with that of the photon number derived in the rate equation 
treatment, see (21) with the spontaneous emission rate W = g?/y. The terms 
linear in the fluctuations give rise to the phase-fluctuations (imaginary part) 
and the amplitude fluctuations (real part). 

If the laser mode is in resonance with the atomic transition frequency, the 
equations for p and @ are uncoupled, so that we may discuss these equations 
separately. 
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Phase noise, linewidth formula 
The equation for ¢ reads 


b+ (ety) ¢O= —Imfex(—is)(yFt + +ig Ta}. (268) 


The factor exp(—i¢) represents a strong nonlinearity. It can be absorbed into 
the fluctuating forces because, as can be shown, the fluctuating forces can be 
represented by random pushes with random phases. Thus exp(id) can be 
combined to a new random phase factor exp[i(¢+¢,). Equation (268) is 
readily integrated. As we will show at the end of this paragraph the linewidth 
(half-width at half-power) due to phase noise is given by 


Av => (GO-8O)), [> O40] (269) 


where the average refers to all coordinates of the noise sources. Using the 
properties of the noise sources F, T one obtains 


1 
= FF ea)? {497 (Ny +N2)+ky? (4 +4)}; (270) 


where N, = N<p2.>,N, = N<p;1;> is the mean occupation number of levels 2 
and 1, respectively, of all atoms and n,, is the thermal photon number. We 
eliminate the coupling constant g, which specializes in the present case of an 
exact resonance to the equation 


g? = Ky/(N2—-N;), (271) 


(note that N, is the actual occupation number under the joint action of “heat- 
baths” and laser field). Thus we find, finally, for the linewidth (halfwidth at 
half power) 


447 Oh 1 No+N 1 
°K >| 2tN, |, (272) 


AV= Cory? PLD N,N, ete 
where P = 2xr*hqw is the output power, is the mode frequency, y and x are 
the atomic and the cavity halfwidth, respectively. 

The term (N,+N,)/2(N,—N,) stems from the atomic noise sources and 
expresses spontaneous emission noise. The term n,,+4 arises from thermal 
resonator noise and vacuum fluctuations and stems from the field noise source 
F(t). It was soon recognized by Shimoda ef al. that both thermal noise quanta 
and spontaneous emission noise quanta should occur in a symmetric manner 
in Ay. This is now achieved by using the complete set of noise sources. In the 
maser, the thermal number of photons n,, is much bigger than unity (because 
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kT >hq) so that the thermal noise of the resonator determines the linewidth. 
In the laser this contribution is entirely negligible and (N.+N,)/(N2—N,)>1 
is dominant. In a two-level system it is many orders of magnitude bigger than 
the other two terms. The expressions for the linewidth above and below 
threshold differ by a factor 4 in the quantum mechanical case as they do in the 
classical case. 

The lineshape is a Lorentzian which is cut off at frequencies ~ x + y. In 
order to derive the lineshape as well as the relation (269) we consider the 
coherence function 


<b (t) (0)>, (273) 
where exp(iot) is split off. We insert 
bt = ro exp[id(t)] (274) 


into (273) and use the fact that both T( = )'T,) and F consists of many 
statistically (or quantum mechanically) independent systems (i.e. they are 
Gaussian), so that 


$(t)—¢(0) = Lt (275) 
<bt(t) b(0)> © ro? [] <explid, (> © ro? [] {1 +iXG,02)> - 46,0) }- 
g . (276 


where ¢ is the fluctuating phase of the mode and @y is the initial value of the 
phase. Because 


<b,» = 0 (277) 
and thus 
¥ <¢,?> = ((@- G0)”; (278) 
we have finally 
<bt(t) b(0)> = ro? exp{—4<(6—$o)”>}. (279) 


The explicit evaluation of (278), which we performed above, has shown that 
the quantity <(¢—¢ )”> is proportional to the time f. On the other hand we 
should expect that for a classical electric field with a Lorentzian line-shape we 
have E(t) oc exp(— Avt) where Av is the half-width at half power. Thus the 
laser line-shape is indeed a Lorentzian. The above line-width formula holds in 
conditions well above threshold, i.e. 


<p> < ro’, 


where p is an amplitude fluctuation. 
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Amplitude fluctuations 


We start again with the linearized equation where we compare now the 
“real’’ parts of both sides (again after multiplication with exp [— i¢(t)]). This 
yields the following equation 


t 
P+ (e+y) p—2lgl? ro? | (6-+2xp) K,(t,1) de 


t 
= -lal?r0| | (ro (P+ FI) 4322-1) Kilt] 
+ Refexp(—id)(yFt + Ft +ig* f,,}, (280) 
where K, (t,t) = —2 exp[—(¢—1)/T] is the integral kernel which stems from 


the elimination of S in (253) and (254). For the solution we perform a 
Fourier-transformation 


p= | - exp(icot) p(a) do. (281) 


We then obtain readily for the Fourier-transform of eqn (280) 


p(@) N(@) = M(@), (282) 
where 
N(w) = {-@? + i0(K-+y)—2g? ro(iw + 2k) K,(o)} (283) 
with 2 
K,(@) = — (284) 


Wi2+Wo1 tio 


The polynomial M(q) is of a complicated structure, so that we drop this 
intermediate result. If N(@) possesses one or several real roots «,, then the 
modulation of p of |b] = r9+ p shows an undamped oscillation (“spiking”’). 
A closer discussion of N(w) shows that a necessary condition for undamped 
spiking is 

yt4wr4,+W12 < K, (285) 


which holds only in exceptional cases however. For the experimentally ob- 
served undamped spiking, other mechanisms must therefore also be invoked. 
Because the calculation of the correlation functions of the amplitude fluctua- 
tions is elementary, but lengthy, we only quote the final result 


ee q G(@) exp(iat) 


-0@=ay? 89 


Ko) ale+1)) = 5 | 
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where G(w) is a polynomial of w of fourth order, while the h’s are polynomials 
of third order. For moderate photon numbers, i.e. 


rok < Wijr?s (287) 
the expression for the intensity fluctuations is found to be 
k(t) = <bt(t) bt(¢+1) b(t +1) b(t)> — <b" by, CBT bras 


1/1 dine to” 
ts 2 1 thr ‘0 2 iy d ae 
2 ( tne + hat do =u Aine exp[ KT( 0 tne)/ thrl> ( 8 ) 


Ii 


where d,,, is the (atomic) inversion at threshold, dp is the unsaturated inversion 
which one would obtain for a given pump rate without laser action. Formula 
(288) holds also for inhomogeneously broadened solid state and gas lasers°” 
in a region above threshold which starts with a photon number ro? which 
equals 5-10 times the photon number at threshold up to photon numbers 
satisfying (287). Because dy)—d,,, is proportional to the photon number re 
the frequency of the intensity fluctuations increases with ro’, whereas (286) 
decreases according to 1/r,?. Both predictions are fully substantiated experi- 
mentally in a region rp? > 10 X (ro7)ene- 


4.7. THE DENSITY MATRIX EQUATION FOR A SET OF 2-LEVEL 
ATOMS AND A FIELD MODE‘>>? 


In this chapter we sketch the method by which to derive the density matrix 
equation and write it down explicitly for 2-level atoms and a single mode. 
The reader, who is interested in the general derivation of the density matrix 
equation for an arbitrary system is referred to in Section 5. 

We consider the interaction of the light mode or the electron of an atom 
with a heatbath or a set of heatbaths. The density matrix W of the total 
system obeys the equationt 


dw i 
<= — 1H, WI, - 289) 
where 
H = AotHyreg (290) 
and 
Hy = H,+H gz, | (291) 


¥ In this section we abbreviate H (field-bath) and H (electron-bath) by Hrg, Hiriela) OF 
(electron) by Hr and H(bath) by Hz. 
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The Hamiltonian of the free field is H, and is 
hab' b light mode (292) 
= | hva,* a, atom (293) 


The Hamiltonian of the heatbath H, may consist of a sum of several 
Hamiltonians corresponding to different heatbaths 


H,z= pa H,;”. (294) 
J 
Hy, represents the interaction between the two systems. For our analysis it is 
convenient to proceed to the interaction representation 
W = exp(iHot/h) W exp[(—iH t/h)] (295) 
Fiz, = exp(iHy t/h) Hyg exp(—iHo t/h). (296) 
The interaction Hamiltonian has the form 


be Bt(t) + bre’ Bit) light mode 
V1 Bi V2 B2 (297)t 


a,ta,e™ B,,(t) + a,ta,e7™ By, (t) electron 
Vi Bi V2 B2 


App =h) V, (t) By (t) = | 


where the V’s are field or electron operator’s and the B’s are bath operators. 
The explicit form of the V’s suggests that we write 


V(t) = V, (0) exp(iAv, 2). 


The idea for the derivation of the density matrix is now this. We solve 
eqn (289) in the interaction representation by perturbation theory up to 
second order 


W = Wo+(—s-) [ae (Aen, wolt(-z) [oar f” dt, {...} (299) 


{ cee } = [Ares (2), [Are (t1), Wo]. 


W, is the density matrix of the total system at time t = 0. 
We then take, on both sides, the average over the heatbaths which are sup- 
posed to be kept at certain given temperatures. Assuming that the average 


+ Here and in the following we include only heatbaths causing real transitions. 
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over the heatbath variables vanishes, i.e. B = 0, we obtain the following result 
i Re i 2 t t2 
p= %+(-7) { ac, | dt, { ..}, (300) 
0 0 
{ Pavers } = > [Vi(t2) B,(t2), [V,-(74) By(11)s Wo]. 
where; _ * 
W= 6) 


and p, fo ate the density matrices of the system averaged over reservoir 
variables. 
We now assume that the heatbaths have a short memory. This implies that 


B(t2) B(ts) ~ 6(t2 —T1), (301) 


so that the double integral reduces to a single integral which may be approxi- 
mated by the product of the integrand and the time ¢ if ¢ is small enough. It 
must be noted that the remaining operators V, still act on the density matrix po. 

A closer inspection shows that the resulting right-hand side can be written 
in the form of commutators (see below). We now bring fo to the left-hand side 
and divide both sides by ft. Taking the limit of small t we may put 


(6—po)lt = dpjat. (302) 


Assuming further that we consecutively repeat the step just done, we may 
insert an arbitrary time into fo. This leaves us with the equation 


dp(t) 
dt 


SF » {Ver b(t), Vil Aner + Vi» CO) Vil A’ xx} (303) 


The coefficients A, A’ contain the correlation functions of the heatbath 
variables. A detailed discussion shows that only those terms k, k’ contribute 
for which Av, = — Av,,, so that we use only those terms further on. 


Specialization of eqn (303)°° 
a. Light mode. We identify V, = b exp(—iot) and V, = bt exp(iat). Using 
new abbreviations for the coefficients A, the density matrix equation reads 


d 

HS = 5(Cb" 9, b]+ 10", pb) + (bp, BY1+Lb, b'D. 304 
We first show how this equation allows us to treat the mean motion of the 
light field amplitude and of the intensity, where we define the average of an 
arbitrary operator O by 


<O) = tr(Op). (305) 
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The average lightfield amplitude obeys the equation 
d<bty 
dt 


where 6 and € are constants. 
The photon number obeys the equation 


dt b> 
dt 


= {io—(f—4)}<b'». (306) 


= 26—2(€-—6)<bt b>. (307) 


Integrating (306), we get 
<bt> = <bt)» exp[iwt ~ (€—6)]. (308) 
The solution of (307) is 


r) 
<b' by = <bt yy exp[—2(E—5)r] +er3 (1 —exp[—2(€—6)t]). (309) 
Evidently the difference —5 must be identified with the decay constant which 
is introduced in the classical theory 
€-d =k, (310) 


whereas the expressions 6/(£—65) must be identified with the photon number 
n(T) at the equilibrium temperature, T. 


b. Atom. We make the following identifications in (298) 
V, = a,' a, exp(ivt), V2 = a,' a, exp(—ivt). 
The density matrix equation is 
dé w 7 » 
ae ar {[a,' a2, pazt a,]+[a,' a, p, a,t a;}} 


Ww ~~ ~ 
to {[a," a,, pa, a,}+[a,t a; p, a,* aj}. (311) 


Using the notation w.,, wy, instead of A,,, A’, we have anticipated that 
these quantities are just the transition rates used earlier. To demonstrate this 
we investigate the rate of change of the average of occupation numbers, 


<a," a2) = tr(a,' arp) (312) 
d dp 
qe" a2) = (a: ay +). (313) 


Inserting for df/dt on the right-hand side, using the cyclic property of traces, 


i 
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and using the relation a;' a, . a; dq = 54; 4;' G_ (for one electron) we readily 
obtain 


d 
di a2‘ a2) = W42<a," A, >— W21<a2' a2). (314) 
In the same way we find 


d 
at a,> = — 4 (Wy. +W21)<a,' a2), (315) 


so that in the present model we have 
2y = Wi2+W 1. (316) 


c. Field mode coupled to many atoms. It is a simple matter to extend the above 
results, a and b, to the joint system of a field mode and many atoms. One 
may show that the expressions for (df/dt);.14 and (d/df),19m just add to each 
other. Returning from the interaction representation to the Schrédinger 
picture and introducing the coherent interaction Hamiltonian (204) we find 


dp i Op (2) 
eS ae — 317 
dt h [Hr p] - ( or Vout ® Ot atom = ( ) 


where Hy, and (@p/6t),;.14 are given by (206) and (304) respectively. The 
right-hand side of eqn (311) gives (69/81) 15m, provided the operators a,t a; 
are supplemented by the atomic index yp. 


4.8. THE EVALUATION OF MULTI-TIME CORRELATION FUNCTIONS 
BY THE SINGLE TIME DENSITY MATRIX”? 


We shall show that the density matrix which contains only one time variable 
may be used to calculate multi-time correlation functions. It is known (see 
Glauber’s chapter in this volume) that the statistical properties (or, in other 
words, the coherence properties) of laser light are determined by correlation 
functions of the form 


CBE (ty)... BN ty’) B(t,) - - (41) (318) 


where we consider a single mode for simplicity. However, all our following 
considerations apply equally well to the multimode case. We assume, that the 
b’s occur ordered with respect to their time 


har 2&3 Mig Sy (319) 
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In order to show how (318) can be calculated by the density matrix, we rep- 
resent (318) in the form 


tr{b(t,) B(ty—1) «+» « b(t1) (to) BT ty’). OM tm’)}; (320) 


where p(to) is the density matrix at time ft) = min (tf, t;). We can rewrite 
expression (320) in a new fashion by means of functional derivatives, 6, 


iy F 
where 


p(a, y) = (1 exp| -if H, ade—i[” (a(t) b, + 0*(t) b,*) dt 


1 1 5” 5” 
Feorcanmecoo By) yg) POM Moan (321) 


y= 


(322) 
x p(to) Texp if H, dt’ +if (y(t’) by +y*(t') by") de’ ) 


reservoir 
Here H,, and H, are Hamiltonians at times t, t’, and «, y are arbitrary 
functions. The time ordering operators, T and T order the terms with latest 
time to the left and to the right, respectively. 

The total Hamiltonian, H, of the system comprises the light field and its 
interaction with the atoms, the interaction between the light field and heat- 
baths, and the interaction between the atoms and the heatbaths, thus we have 


H = Hs+Hs_- >», 
where Hyg = Heieia t+ atoms +A ieta-atom (323) 
and As—p = Agieta-bath + Hatoms-path 
It is assumed that the free motion of the heatbaths is split off, so that H, 
contains the coordinates of the heatbaths in the interaction representation. 
The problem is now reduced to the determination of the extended density 


matrix (322). By differentiation of both sides of (322) with respect to time we 
obtain an equation for p(a, ). 


dp(a, ; j 
SP). iH, 1 — i(alt) b+ arb") p+ pi +7") ay 


a iKLH sp; Pp] Pres" 


The right-hand side of (322) defines but without the reservoir average. 
The last term calculated from the above, coincides with 


OF esl an) 
= + | — 
( ot incoh, light ot incoh. atom 
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on the right-hand side of eqn (317). Therefore, the only difference between the 
usual density matrix and the new equation (324) consists in the additional 
terms containing the «’s and y’s which are arbitrary functions (which may be 
small). In order to determine the correlation function (318), the system must 
be “‘probed” by classical external fields.t The 5’s in (324) are, of course, time 
independent. If we want to calculate only a limited number of correlation 
functions, it is sufficient to formulate an initial value problem. For this we put 


a(t) = >) a; d(t—t;), yo) = > yi" O(t—t,) (325) 


and replace the functional derivative by an ordinary derivative with respect to 
a. Here t, is the time at which the jump occurs. Obviously the solution of (324) 
is now replaced by solving the usual density matrix equation (317) with the 
additional jump conditions given by 


dp 


tite 
< _ = Plt 0)— p(t, 0) = —indf Bt" —1) plt') dt" (x96 


= —iabp(t;—0)-+correction terms of order «” 


The corrections of order «? and higher order may be neglected as long as the 
differentiation in eqn (326) is performed only once with respect to time at a 
given time. The case that several operators occur in (318) at the same time may 
be then obtained as a special case by letting the time difference go to zero. 

So far we have been considering correlation functions with respect to 
photon operators. In a similar way, we may do this, of course, with atomic 
operators or with mixed terms, for instance, with 


(G54 gy Ey") «© + Gig Beg tm’ Gi," ay, (tn) « «« (Gi,' ag, (t1)>, (327) 


where the a,’ and a, are the creation and annihilation operators of electrons. 
Obviously one has now a density matrix equation of the form 


dp ’ : 
ar = —iLHs, p]+ Ay p+ pAz—iy Dyu(t) a" ay P 


+ pi >, D' y(t) a;t a, —i(a(t) b+ a(t) bf) pt+ip(y(t) b+y*(t) bt) (328) 
ik 


and the correlation function (327) may be found by functional derivatives 
with respect to the coefficients D. 


+ This establishes a connection with Kubo’s point of view.6® 
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4.9. GENERALIZED FOKKER—PLANCK EQUATION; DEFINITION 
OF DISTRIBUTION FUNCTIONS 


a. For the field 


(i) Wigner distribution function and related representations. The Wigner distri- 
bution function ©* establishes a connection between the quantum mechanical 
wave function, or more generally, the density matrix, and a c-number distri- 
bution function of (classical) variables (uf). The Wigner function is 


f (e.%) i | exp(—in, B, —iu, Bp) tr{exp(iB,x+i B2p)p} 4B, dB,, (329) 


where x and p are coordinate and momentum operators,} respectively, of a 
harmonic oscillator, and p is the density matrix. 

Equation (329) establishes a connection between the quantum mechanical 
variables x and p and classical variables u,, u: 


classical transformation (329) quantum mechanical 
uy; - > x 
h d (330) 
Ms aa PT 


Equation (329) corresponds to the definition of the distribution function in 
probability theory of classical variables where x and p would be random 
functions and tr[exp (if, x + iB, p) p] would be just the characteristic func- 
tion. The definition is not confined to an harmonic oscillator, but can also be 
used for other quantum systems. In the following we leave it open whether p 
or (x, p) are time dependent operators, because (329) holds for both the 
Schrédinger and Heisenberg representation. 

In our present context it is advantageous to proceed from (x, p) of the 
harmonic oscillator to the corresponding Bose creation and annihilation 
operators b', b by the equations 


x= (or +0), [— 


(331) 
p= iot—b) | 
2 
Inserting (331) into (329) yields 
| exp(— ifu—i B*u*) tr{exp(iBb + iB*b‘)p} d7B, (332) 


+ x and p are defined in such a way that the Hamilton operator becomes H=4}({p? +a? q?). 
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where we have introduced 
h(B, 4 Ou, +i, 
Ve 3 sl" Ba) 4 = ~"/(Qeh) (333) 
d?8 = d(Re f) d(Im f) 
Using Feynman’s disentangling techniques” we obtain 


exp(ifb-+ ip* bt) = exp(|f|7/2) exp(iBb) exp(ip* b*) (334) 


The sequence of the operators exp(iBb) and exp(if*b') is rather unimportant 
because we have 


exp(if*b') exp(ifb) = exp(iBb) exp(iB*b') exp ||? (335) 


There are thus two equivalent forms of (332): 


1 
W (u, u*) = =| exp(—iBu— ip*u*) tr{exp(iBb) exp(ip*b') p} exp(|B|7/2)d7B 
. (336) 


1 i. 
= a | exp(—ifu—iBtur) tr{exp(iB*b) exp( 6) p} exp(— |pI"/2)428 
(337) 


where the Wigner distribution function, W(u, u*), is now normalized with 
respect to the complex u-plane. 

Somewhat different definitionst for distribution functions are obtained 
by omitting the factors exp(+|B7|/2) 


P(u, u*) = - in] exp(—iBu—ip*u*)tr{exp(ip*b*) exp(ifb) p}d*B (338a) 


QO(u, u*) = + | J exp(—ifu—iptur) tr{exp(iBb) exp(iB*b") p}d?B. — (338b) 


Introducing coherent states |u> by the equation 

blu> = ulv, (339) 
we can prove that P(u, u*) is the Glauber and Sudarshan‘°” diagonal repre- 
sentation of the density operator p, given by 


p= [Po u*)|u> < uld?u, (340) 


+ All definitions of W, P, Q in this section secure that W, P, Q are real provided P is a hermi- 
tian operator. Note, that P, W are not necessarily positive, however. Therefore P, W do not 
have the usual meaning of a probability distribution, but they allow us to calculate all 
quantum mechanical expectation values purely by c-number procedures (see below). 
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whereas Q(u, u*) is the diagonal matrix element of the density operator in 
coherent states |u>: 


O(u, ut) = — <ulplu. (341) 


For proof of (340) we insert this representation into the right-hand side of 
(338a) and use the cyclic property of the trace tr(AB) = tr(BA). We then get 


z| J J | P(u', u’™) exp[—iB(u—u')— ip*(u* —u'*)] du’ d?B 


which is equal to P(u, u*). To prove (341) we insert the coherent state rep- 
resentation of the unity operator 


1 = -| Ju> <ul d2u 


within the trace, leading to 
1 
tr{exp(iBb) exp(iprot)— | |u> <u| d?u P| aoe i exp(ipu + ip*u*)<ul plu>d7u 


Inserting this expression at the right-hand side of (339) and making use of the 
6-function representation we immediately get (341). Due to the close relation- 
ship of the functions W, P,Q there exist equations connecting W, P, Q that 
read, according to Glauber, 


O(u, u*) = =| exp(—2|u— |?) W(a, «*) da 
W(u,u*) = ={ exp(—2|u— |?) P(a, a*) da 


O(u, u*) = = | exn(- |u— a”) P(a, a*) d2a. 


(ii) Transforms of the Wigner distribution function: characteristic functions. 
Depending on the problem, one may use the Fourier-transform of the 
Wigner-function (329) instead of (329). This new function may be called the 
characteristic function (in analogy to classical statistics). It is given by 


Xw(B1, B2) = tr{exp(iB, x+iB2 p)p} (342) 
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(apart from a constant factor), Similarly, the characteristic functions 
belonging to (338) are 


xr(B, B*) = tr{exp(ip*b') exp(ipb) p} (343a) 
x0(B, B*) = tr{exp(iBb) exp(iB*b") p} (343b) 


Note, that these latter functions are Fourier transforms of (338) but not of 
(332), because of the factor exp(+|f|?/2) (see (337)). 


b. For electrons 


(i) Distribution functions for a single electron. Schmid and Risken®* have 
defined the distribution function 


Sx) = nf ae .{exp| -7 So én tr{exp ly ex a'a,| plate) (344) 


with 


= * 
Vin = UR. 


Here the v’s are classical variables associated with electron operators and N’ 
is a normalization constant, so that 


[--[roee = 1, 


According to Haken et al.” another possibility is given by 


S(O) = nf es .{ exe( ate Vix én) tr( [exp En aja) 


(345) 
2 II exp(it, a,'a,) [] exp(itj, a; a,) d{g} 


j>k 

which has great advantages in practical calculations (see Section 4.10). When 
an equation of the Fokker—Planck type for the /’s is derived in the case of a 
single atom it turns out that the f’s are singular and that diffusion coefficients 
of high order are needed. In the actual laser one has to deal, however, with a 
great number of equivalent atoms. It is then possible to introduce macroscopic 
variables which show only small fluctuations, so that only the first terms of the 
generalized Fokker—Planck equation are important (see below). For a system 
of two-level atoms these macroscopic variables are defined as follows: 
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Macroscopic classical variables quantum mechanical operators 
total dipole moment 
v* <—-—______> S* =) (a,fa,), 
Lu 
v -——____ > S” =) (a,' a), (346) 
BR 


total inversion 
D OO 28S, = Y (a2! az —a,* ay), 
a 


The above correspondence is established by the distribution function 


f(v, v*, D) = N’ | ahs i exp(—if{vé + v*E* + D/2}) x(E, &, 0) dE dC. 
(347) 
where 


x = tr{exp(ig*S*) exp(ilS,) exp(iES~) p}. (348) 


A trivial generalization consists in replacing ¢S, by €,N,+¢&,N, where 
N,, N, are the total occupation numbers of levels 1 and 2. 


(ii) Characteristic functions. These functions are defined by 


xE) = tr{exp(# bn a'a,) P| 
J 
or (349) 
un) = t{ [Texel aj'as) [Tex (its a) [Lexp(itie at a) o| 


(iii) Electrons and fields. In this case it is possible to take any combination of 
the definition of (i) and (ii). In the following we shall consider the example 


I (u, u*, v, v*, D) = nf. ; .{ exp| -i(og-+0* Ee 4 Cr tup+utp*) | 
x (6, &*, C, B, B*) d7é dl d?B, (350) 


where 


x = tr{Oc’, &*, ¢, B, B*) p(t)} 


O = exp(it* S*) exp(il S,) exp(iES~) exp(if* bt) exp(ifb). (351) 


The characteristic function is obviously y. 

To find the equation of motion for the distribution function of the field 
it makes little difference whether we use the Heisenberg or the Schrédinger 
representation. In the Heisenberg representation the operators are time 


and 
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dependent, whereas the density matrix is considered as time independent. On 
the other hand, in the Schrédinger representation, the operators are time 
independent, whereas the density matrix is time dependent. For the atoms it 
turns out, however, that the Heisenberg picture leads to very complicated 
expressions. So far, nobody has been able to derive a completely exact equation 
for the distribution function in that case (see e.g. Lax and Yuen“**). On the 
other hand, the treatment of Haken et al.,“°“) who used the Schrédinger 
representation and a product of exponential operators, has led to a completely 
exact equation for two-level atoms. We first give a sketch of how to derive 
this exact equation and then we present the results for the arbitrary dissipa- 
tive quantum system which comprises as special cases an N-level atom (see 
Section 5). 


4.10 GENERALIZED FOKKER—PLANCK EQUATION 
Exact equation for the distribution function for a set of two level atoms» + 


We differentiate both sides of (350) with respect to the time. Because y 
depends on the time only, via p (see (351), we have 


a= lo. (352) 


When we introduce for dp/dt, the right-hand side of the density matrix, we 
obtain expressions of the form 


tr(Oa,' a, p), tr(Oa,t a, pa,'a,), tr(Obpb*), etc. (353) 
If a,'a, and b were not operators, we could obtain, for example, the expression 
tr(O(a,' a, —a,' a,)p), (354) 


by differentiating O with respect to € which occurs as a factor of (a,t a, — 
a,' a,)in the exponential. Because the a,t a, are operators, the procedure is not 
quite so simple, because by commuting a pair of a’s with another pair of a’s 
usually new pairs are generated. However, these pairs form an algebraic ring 
which is finite. Therefore, it is, in fact, possible to replace all suitable express- 
ions of the form (353) by derivatives of O with respect to classical c-numbers. 
Therefore all quantum mechanical procedures can be replaced by c-number 
procedures with respect to the characteristic function. When the Fourier 
transform of the characteristic function is taken, all procedures with respect to 
€, €*, etc. are transformed into those with respect to u, u*, v, v*, D. Thus we 
obtain a differential equation for f. We give some details of this calculation in 
the following. 


{ The reader who is not familiar with the derivation of a classical Fokker-Planck equation 
is advised to read 4.12c, 
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Because the density matrix equation (317) consists of three parts referring 
to the electrons (atoms), the fields, and the interaction between electrons and 
field, we decompose dy/dt correspondingly: 


a . (0(3r),} (356) 
(2). a tr{o* [Haz pi}, | (357) 


where the sum runs over all atoms, yz. Considering a single atom, we have 
tr{O(a',a, —a",a,) p}. 
We specialize O to a single atom by replacing 
S,*,S8,7,S, by a,ta,,a;fa,; (a,4a,—a,' a,)/2 
respectively and consider 


exp(ié* a,‘ a,) exp[if3(a,' a, —a,* a,)] exp(iéa,* a,)(a.' a, —a,! a,). 
(358) 


We shift the last bracket through the last exponential function and calculate 
the commutator: 


exp(ita,' a,)(a,* a, —a,' a,)—(azt a, —a,' a,) exp(ita,t az). — (358a) 
We first expand 
exp(iga,' a2) = 1+i€a,' a, +4(i€a,' a,)? +... 
which reduces on account of (225) and (224), to 
exp(ifa,' a,) = +1ita,' a, 
Inserting this expression into (358a) and using again (225), we obtain 
(358a) = 2i€a,* ap, 
which may be written in the form 


2ita,* a, exp(ita,' a,). 
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Thus (358) = 
exp(ic* a,' ay) exp[i3(a,' a2 —a,' a;)K((a.' az— a,' a,)+ 2ita,* a>) 
x exp(iéa,' a>) 


a0 00 
Date ae (359) 
Using (351) and (359) we find 
tr(O(a,t a,—a,ta ) p) = 2k 4 26%. (360) 


When we now take the Fourier transform of the right-hand side of equation 
(360) we have 


0 
[ expc—ito.. Je ae iz, = 15 | exp(— igo) ag. S44 


and, by partial integration 


15 {—[aSexo(—ito)...48...| 


- = (of xexp(—iko...) dé...) = Sof. G61) 


ll 


Il 


Consider as a further example, the expression 


exp(iB* bt) exp(ipb) btb. (362) 
We shift bt through exp(ipb) 
exp(iBb) bt — bt exp(iBb) = if exp(ifb). (363) 
Thus (362) gives 
exp(if* bt)(bt + iB) exp(iBb) b (364) 
= a eee iR* Bt ; 
~ (sea aq) e808" 2D exe se 
and therefore g2 a 
i) en See eee ede 
(00) = (Saat aap) Sa 


Taking the Fourier transform of (366) we can make similar transformations 
as above. 
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We hope that the above considerations elucidate the spirit of the deriva- 
tion, so that we proceed to give the final equation for /: 


df/dt = Lf (367) 
where 
L=Lythyt+Lly 


and 
2 


Wi a 6 oe r) 
L,= 5 [s jexo(-255)—1| +Nexp(2 =) Feta? +2N ae abe 


0 ia 0 
a * 
+a =| scar tex xp(- 25) 1] (368) 
a d at D 
-2| |exe( - 255) 7 1 ~exp(255) 0,2 0,* =| | 
Was 2) 1] 42042 ortzfenn(22)-1] | 
+ 5) (N |exe(255) 1] +505 +2) exp 2 1 rae 


0 a? 0 
—. pF gs * 
Lat ia{ [exo( - -27) o— a5 Saet 5 D|u 


0 o a? 
L,= ls Wau ‘| + 2K, Te (370) 


4.11. THE CALCULATION OF MULTI-TIME CORRELATION FUNCTIONS 
BY MEANS OF THE DISTRIBUTION FUNCTION” 


So far we have shown that we can derive a well defined equation for the 
distribution function (347). We have now to show how the properties of the 


THE SEMICLASSICAL AND QUANTUM THEORY OF THE LASER 279 


laser can be calculated by means of this function f. The laser properties, in 
particular its coherence properties, are determined by correlation functions of 
the form 


<bY(t,’)... bY(ty’) b(t,) ... B(t1)). (371) 


In (4.8) we have shown that (371) can be evaluated by the single atom density 
matrix. Because fis intimately connected with p, we start with the form (321). 

The solution of eqn (324) defines p(a, y), where the last term is explicitly 
given by (317). We define a generalized distribution function by 


f (u, u*, v, v*, D; a(t), y(t); t) 
=n’. . .[ exp(— int —wre iS —iuB —iu* B*) 
X(é, *, C, B, B*; a(t), y(t)) d7é de dB, (372) 


where 


x = tr {exp(ig*S*) exp(ifS_) exp(iES~) exp(iB*b*) exp(iBb) p(a, y; t)}. 
(373 


Evidently the trace over p(«, y) may be expressed by 
tr{p(a, 7,1} = { f(u ut, v, 0%, D, 0,7, 1)@udvdD. (374) 


All steps which were done for the derivation of an equation for p may be 
simply repeated. We then find 


af —Lf = —ia(t)uf —iy*(t)u*f. (375) 


We again introduce the decomposition (325) so that (375) becomes 


df = 
a, LS = 0 (376) 


for t 4 t, and additional jump conditions 
S(,+90) —f(,-0) = —ia,uf(t;—0)+corrections of order «?. (377) 
We define a Green’s function by 


dP 
“ar —LP = 6(t—t,)d(v—v,) d(u—u,) 6(D— Dy) (378) 
and use the notation 
P(up, u,*, v2, v,*, D, > U4, u,*, V4, v,*, D, > to, ty) — P(2, 1) 
PU, 0) = f(a = 0;y = 0). (379) 
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According to the jumps at time ¢,, f,,7¢,;... we decompose the solution of 
eqn (376) into these time intervals, where we find for 


t<t, f= P(1,0), (380) 


where P(1,0) is the stationary state solution of eqn (376). For the next time 
interval we find 


f= P(, o+{ (—ia, uy +iy,* u,*) PQ, 1) PC, 0) aV,, (381) 


where 
dV, = du, du,* dv, dv,* dD,, (382) 


and for a time ft after the last jump at ¢,, we obtain the expression 


f=P(1,0+¥, foe f [Tot 1s aig uy bing u,*)} P(1,0) dV;...dV,. 
v=1 H=1 


y (383) 


Because we must differentiate f finally with respect to all «’s and y*’s respect- 
ively, we need only retain the term with v = n, so that we need only consider 
the equation 


{rar -{ I (—ia, u,-+iy,* u,*) P(u w—1)dV,...dV,, (384) 

no dnd n= 

where we have integrated both sides over V,,, = V and have used the relation 
[Pott.m deer = |, (385) 

When we differentiate with respect to « and y* and make, if necessary, the 


transition to equal times (e.g. t; = ¢;,,) we find our final result in the following 
form 


<(B*(t1))" (4 (02)... (OT))”" (Bt) )"*- - (BC) 


n 
= i ‘ | ae Gay a I P(u, u—1)dV,...dV,. (386) 
n R= 


Note that (386) is the most general formula, because by putting some of the 
’s or v’s equal to zero, we may obtain all desired expectation values provided 
the operators are time ordered and in normal order. 
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4.12. REDUCTION OF THE GENERALIZED FOKKER-PLANCK EQUATION 
FOR TWO-LEVEL ATOMS AND FOR SINGLE-MODE ACTION 


In this section we show how the exact generalized Fokker-Planck equation 
can be reduced step by step. In particular the adiabatic approximation is used 
which assumes that the relaxation times of the atomic system are much shorter 
than all other relevant times. This approximation holds to an excellent degree 
in the threshold region and above, up to rather high photon numbers. The 
resulting Fokker—Planck equation is valid for solid state, gas and semi- 
conductor lasers. It is solved in Section 4.13, for the steady state. In 4.14 we 
shall represent the Fokker-Planck equation for multimode action near 
threshold and give its (nearly exact) stationary solution. 


a. The first reduction: expansion in terms of powers of N~* (N = number of 
atoms) ®>) 


We introduce units 
u Dv os D 
i= j= : D= : (387 
VMnr Vine Deny ) 
where 
Nene = 5 N* (388) 
6, = 401 9) /K(e+y,))*, (389) 
27, = Wy2tWa1, Yq = Wy2t Way, (390) 
Vr” = K? My, /g? = 52 N*?/K, (391) 
K = @’N/xy, = 1/dy,,5 (392) 
6. = 4(Ky/y(e+9,))*; (393) 
and 
Du, = N/K. (394) 


We consider K as fixed, but N to be very large. In the present context, ny, 
Vine’ and D,,, merely serve as abbreviations of the corresponding right-hand 
sides of eqns (388), (391) and (394). Later it will be shown, however, that 
these quantities are just the values of the photon number, of the absolute 
square of the total atomic dipole moment, and of the total inversion at 
threshold. The threshold inversion of the single atom is d,,. Taking into 
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account terms of the order one and N~?, we find 
miaK 
6, éd0b* 
(395) 


0 nea © ae a 
Lg = (W21-W12) Keg tn(s Ot Ae a) + yy DN 3 


ee 
Lyu-p = —ig| 25 8 85,82 K) 


1 1 
(JK JK 


Since g ~ N~? for fixed K, (392), we have neglected all terms in the curly 
bracket which are proportional to N~*, N~*,.... In order to obtain a concise 
formula, we replace the complex variables u and v by two-dimensional vectors 
thus 


a. a 
+52 Dil /(1/62) Nt = 8 V(6al61) Nt—c..} (396) 


(u,u*) > u —i(v, v*) 0. 
Also, we put _ 
Do = Niv12 Wat (397) 
Wi2tWo 


After a short transformation we obtain 


é é 
“Sev (Kut aw) f}+A{(—2 v+gDu)f} +1 (Do —D) —4guv] f} 


2 


K 0 
= Fa bef HEN W12 Mf t7y NE (398) 


An equation of this form was first obtained by Risken et al.4* There is only a 
slight difference between their equation and eqn (398). They used symmetrized 
diffusion coefficients and found n,,,+4 instead of n,,, and 4N(w,,+w,,)/2 
instead of w,, in the first two terms. A detailed analysis shows that both forms 
lead to the same results, as long as higher derivatives are neglected. 


b. Adiabatic elimination of the atomic variables 


With respect to the statistical properties of the laser field, the distribution 
function fcontains more information than is needed, because it also contains 
the atomic variables. Thus it is desirable to eliminate these variables, so that a 
function W(u, u*) of the field variables alone can be determined. If the re- 
laxation times of the atomic system are short compared to all other times of 
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the systems, the atomic variables (i.e. the dipole moment and the inversion) 
follow the motion of the field adiabatically. Under this assumption the atomic 
variable can be easily eliminated and, to do this, two ways are known: 


(1) direct elimination within the Fokker—Planck equation ;©® 


(2) elimination via the Langevin equations.©” 


We present the second procedure because it is easier to follow. 

The Fokker—Planck equation (398) possesses the following associated 
Langevin equations for the complex light amplitude u, the total dipole moment 
v, and the inversion D: 


d 
pk iqvu = 399 
(<,+*) utigv=T,, (399) 
d ; 
(S+n) v~—iguD = il ,, (400) 
d ; 
($+) D—D.)+2i9(otu— vu =I: (401) 


Note that these equations are now classical with classical random forces. The 
diffusion constants are given by 


J T f(T 
Q= se) | ied Fe dr dee (402) 


A comparison with the RSW-Fokker-Planck equation” (in the symmetrized 
form) yields 


Qu =F (nt Ds (403) 


Wai tWi2 


Q,=4Ny, =4(Ni+N2) 71, = 4N (404) 


Had we used the Fokker—Planck equation (398) instead, the result would read 
Q,, = KMy/ 2, (405) 
OQ, = Nw,,/4. (406) 


We shall see below that the results are not affected by the difference between 
(403), (404) and (405), (406) so that the corresponding Fokker—Planck 
equations are equivalent. We now eliminate the atomic variables. In accord- 
ance with the adiabatic assumption we neglect d/d¢ compared to y, and »,. 
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Neglecting I, too, we obtain 
4g? 


D=D5-— 
° Yue 


Do uu*, (407) 


Inserting (407) into (400) and again neglecting d/d¢t we obtain the dipole 
moment as a function of the field alone. Introducing the corresponding ex- 
pression into the field equation (399) we find 


<u P—wru)u =F, (408) 
where Ky 
| 409 
Yi +YDN2—N ir one 
ss Y 
fi = d = —— (Do~Dinx)s (410) 
Dir= (N2 —Ny)inr = Ky, /97, (411) 
2 

re—*p4— er, (412) 

K+Y, K+Y, 


c. Classical derivation of Fokker—Planck equationt 


In the foregoing chapters we have shown how the Fokker—Planck equation 
for a laser near threshold can be derived step by step, starting from the 
quantum mechanical density matrix equations. The resulting equation can 
also be obtained in a more heuristic way by interpreting the quantum mech- 
anical van der Pol equation (267) as a classical equation. We briefly remind 
the reader by means of an example how the Fokker—Planck equation is 
derived for a Markoff-process. Consider the equation of motion of a classical 
Brownian particle, which is moving e.g. in a liquid or gas. Its velocity y 
changes due to a friction force — yy and a randomly fluctuating (Langevin) 
force F(t). We assume that the ensemble average over F(t) vanishes: <F(t)), 
and that the reservoir i.e. the surrounding gas causing the fluctuations of 
F(t), has a very short memory, so that 


CF(t) F(t’)> ~ 6-1’) (413) 


+ This equation is slightly more general than eqn (267) in that we have now x+y, instead 
of y1 (=). 

{ Readers who are familiar with the derivation of a classical Fokker—Planck equation may 
proceed directly to 4.12d. 
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We now want to derive an equation as to what the probability is of finding the 
particle with a velocity x at a given time. For each realization of the motion, 
i.e. for a given sequence of pushes described by F, we obtain a definite “path”. 
The probability of finding the particle of a velocity y is then 

P(x, t) = 5(x—x1(0)). (414) 


If the “‘gedanken” experiment is repeated, we would find another path 
realized > 


P(x, t) = 6(y—-x2(t)), 
P(x, t) = 6(y—x3(t)). (415) 


When the experiment is repeated often, we can define an average value of P by 


» #5(x— x(t) 
I(% 1) = PW t)> = a aa <i(x—x(2))>, (416) 


i 


where w, is the frequency (or mathematically speaking, the probability) of 
realization of the special path i. In order to derive an equation for f we form 


Af t) a f(y, t+ At) Sf t) 


= ((x—x(t+ At))>—<d(x-x())>. (417) 
Using 


x(t+ At) = x(t) + Ax(t), (418) 
we expand the right-hand side of eqn (417): 


Af (x,t) = ((-5-x0)) x) +4( 0G n(N(Ax(0) ). (419) 


We now use the Langevin equation which describes the motion of the 
Brownian particle: 


x(t) = ~x() + FO), (420) 
from which we find 
Ay = —yyAt+ AF, (421) 


where At is so small that y has changed only little, but so large that AF 
contains many pushes. We use <AF> = Oand find further on account of (413): 


<AF(t) AF(t)> = 2Q. At (422) 


i.e. a proportionality to Az. 
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We now treat 
d 
(s 0(x—x(t)) ax()) (423) 
Because x does not depend on the averaging procedures, we may write 
d 
(423) = yo —x(t)) Ax(t)>. (424) 
Replacing Ay(¢) according to (421) we find 
d 
(424) = by OX MOAD) At(—y)+ AF(t))>. (425) 


Due to the 6-function we may replace y(t) by x; due to the Markoff assumption 
we may split the term containing AF(t) into a product of averages, so that we 
have finally: 


(58-200) x) = £(54—-O)( Atyy)> + 6(%- x) DK AF(D))}; 
x x (426) 


where <AF(t)> = 0. 


We treat the second term in (419) similarly. Keeping terms ~ At and dropping 
those ~ (Ar)?, we find 


d2 d? 
(a 8¢1—n(0 NA) a Gee HD) -2Q. At. (427) 


Inserting (425) and (427) into (419), dividing the resulting equation by At and 
letting 


Af df 


ato At df’ 
we obtain the Fokker—Planck equation 


af 


d d2 
ao dy 6 HU +O Fal t). (428) 


On account of the Markoff assumption, no higher derivatives occur. The 
derivation goes through the same way if the friction force —yy is replaced by a 
nonlinear force K(x), e.g. of the form 


K = +Bxy(d—-y’). (429) 
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The derivation is easily extended to e.g. three dimensions or more variables. 
Let us return to the laser, described by the Langevin equation (408) We 
decompose the complex amplitude b*: 


b* = rexp(id) = x+iy (430) 
and replace f by W(y,y) or W(r,@). The Fokker—Planck equation for a 
particle moving in two dimensions under a friction force 
a A — (v2 2 
K, = +By(d—(? +y”)) 3 
is given in analogy to (428): 


df 


eee dowd ee 
aoe gba ty DF = qd Sry DT} +0(F ‘ a) : 


(432) 


d. The Fokker-Planck equation 


We write the complex field amplitude u (= b*), in polar coordinates: 
u = rexp(id). The Fokker—Planck equation belonging to the classical van der 
Pol equation (in the rotating wave approximation (408) or (267)) reads‘°®) 


ow ~ = (r=) 1 ew 


1 @d 
ath, A-IPW) = OLE (a) tage} 9 


The diffusion constant @ is given by 
1 T f(T 
o=a5[ | reread at (434) 


Using the form (412) of T’ and the fact that [., and [, are uncorrelated, we 
obtain 


o= (2 )'0+—o 5 0. (435) 
K+¥i (K+y1) 
Inserting (403) and (404) into (435) we find 
¥. \?K 1 g 1 
e = a Nis, 4 
O= (2) Fmt +e inn (436) 


We now eliminate the coupling constant using the threshold condition 


(N2—Naene = KVi/9 e (437) 
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After re-arranging (436) we obtain, finally, 


o-+| tae Msrurtcesand 


2 K+, (N2—-N Dene 
(438) 
4 N 
= x Yo | | nat 2thr | 
2 LK+y, (N2—N yer 
(Note that N = N, +N, = (N, +Ny)tnr)- 
Let us now discuss the diffusion coefficient using (405) and (406): 
m1 \*k Ky 1 
o-( ) Sn + ——; ————- Nv, 439 
etn) 2" Geta)? Wa=Niad oe 


where use of (437) has been made again. Because N, = (wW,2/(w,2+W2,)) N at 
threshold, (439) and (438) are indeed equivalent, provided that the photon 
numbers are not too large. 


4.13. SOLUTION OF THE REDUCED FOKKER—PLANCK EQUATION 


a. Steady state solution 
First we introduce normalized coordinates: 


P= Y(B/Q)r; t= J/(8Q)t; d= V(Q/A)a. (440) 
Using these, the Fokker—Planck equation then takes the form 
1 a ( ‘Z) 1 ew 


Fo op) tage 


ow 


1 a ee _ 


(441) 


Its stationary solution is given by® 


Pr 


N ad a2 1 re) pt p2 
Wr) = sex] — Fea], w-{, perp] - 7 +a] dP. (442) 


The distribution function (442) is a continuous function of ? (or r). In order to 
compare it with the distribution, p(n, T), of the discrete photon numbers, n, 
which are measured in the time-interval T; a correction derived by Mandel 
must be added. If T is small compared to the relaxation times of the van der 
Pol oscillator, p(n, T) is given by 


ss exp(— al) W(2) df, (443) 


n 


P(n, T) = iN 
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wheret 
P=?7?,  &@=a,/(Q/f).T. 


On account of W(1) = 2W(f) and using the explicit form of W(?) (442) we 
find 


_ &" F,(a—28) 
where F,,(a) is given by 
% 72 
F(a) -{ I" exp| = +a z di. (445) 
0 4 2 


For a large average number of photons, i.e. for large @ we obtain a (nearly) 
continuous distribution 


P(n, T) = Fay ?{- (4) + sar. (446) 


b. Non-stationary solutions 


In order to obtain correlation functions, the non-stationary solutions of the 
Fokker—Planck equations must be used. Since general analytical expressions 
are not known, approximations must be used or computer solutions must be 
sought. There are some exact relations, however. If we denote the Green’s 
function belonging to (441) by 


FP, 3%, 6’, 2) 


the correlation function of the intensity fluctuations is given by” 


K(a, t) = ((P?°(P +4) — <P) (PD) — <P>) 
7 i { | | Pd PY dP’ de d'(P2—<P2))(P'2— CP"2)) F(P, Os Pb’, Nes 
Expressing F by its decomposition into eigenfunctions, one finds 
K(a,*) = K(a,0) DH Mp €xp(—Aomé)- (448) 


The M,,’s are certain moments of ? with respect to the eigenfunctions, 1,,’s are 
the corresponding eigenvalues. 

The spectrum S(a, ) of the intensity fluctuations is given by the Fourier 
transform of the correlation function (448)t 
} «is a proportionality factor which connects the probability of measuring a single photon 
in the infinitesimal time interval dt with intensity J = P(1, dt) = of dz. 
{ The use of & instead of @ indicates that we are using reduced units. 
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dom 


"BS (449) 


S(a,0) = K(a,0) ¥. M 
1 


Although it is a sum of Lorentzian lines of widths Ao,,, the spectrum S(a, @) 
may be well approximated by an “effective” Lorentzian curve? 


eg f 


Ss ®) = K(a, 0)———— 
ett (4, ) (a Fn 


1 » M (450) 
with = a 


which has the same area and the same maximum value. This effective width 
Ate 18, however, about 25% larger than 4), for a ~ 5. The eigenvalues and 
matrix elements were calculated numerically by Risken and Vollmer” for 
the threshold region —10 < a < 10. The results of Risken and Vollmer were 
experimentally checked by a number of authors and excellent agreement was 
found (see the chapter by E. R. Pike in this volume, where the relevant 
references are given). 


4.14. THE FOKKER—PLANCK EQUATION FOR MULTIMODE ACTION NEAR THRESHOLD 
a. Stationary solution 


In Section 4.10 we have derived an exact generalized Fokker—Planck 
equation for a single mode and a system of two-level atoms. In a region not 
too far above threshold we have reduced that equation to an ordinary Fokker— 
Planck equation for the field variables alone. The whole procedure can be 
extended to multimode laser action with multi-level atoms. We do not intend 
to repeat all the steps in detail, because the equations are very lengthy. We 
rather want to describe how to derive this final equation be merely looking at 
some of the intermediate results of our single-mode analysis. 

The first important result was that the Fokker—Planck equation, after the 
first reduction, can be interpreted as being connected with a set of classical 
Langevin equations. If we remember the results of Section 3 on the semi- 
classical treatment of the laser equations, we find by comparison, that the 
classical Langevin equations coincide with the semi-classical laser equations 
(if the fluctuating forces are dropped). Because the drift coefficients are 
identical to the right-hand sides of these Langevin equations (without 
fluctuating forces), we can immediately use the corresponding expressions of 
the semiclassical analysis. Furthermore, we have seen that the subsequent 
adiabatic elimination can best be done in the framework of the Langevin 
equations. This elimination procedure has been performed in Section 3 for 
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multimode action, so that we can use the results of this chapter for the explicit 
form of the drift coefficients. Finally it is obvious from our above analysis that 
the diffusion coefficients can be found from fluctuating forces of the quantum 
mechanical Langevin equations. It can be shown that the diffusion coefficients 
Q;; vanish for different modes, i.j. The proof of this simply uses the fact that 
the thermal heatbaths of the modes are uncorrelated and that the modes are 
orthogonal with respect to their spatial dependence. 

With these considerations, it is quite simple to write down the Fokker— 
Planck equation for multimode laser action in homogeneously or inhomo- 
geneously broadened lasers. Denoting the complex field amplitudes of the 
modes A by u,, u,*, we obtain 


d r) 0 
di W (uy, ...,Uy) = = 25, Aa veey Uy) — days Aw veep Uy) 
i G2 
+ F%r—s| W (uy, --+> Ung)s (451) 


where 


A,* (uy, ...5 = (i@,— *_y* \ d = 
at (uy Uy) = (i@,—-K,) uy a » onl G nal Q,—v,—i, 


* * *y * 
42 DY dou Gua® Guar* Gnas Gurs Ua, ag U,, 
BALA2A3 


x (> +i(Q,,- 2,)) (Q,,+2,,—-2,,-,— no} ~* (452) 


‘ O,,—- 94,4 2i7, 
(Q,,—v,~—i7)(Q,,—-v, tip.) ; 


In these equations w, and x, are mode frequency and halfwidth in the un- 
loaded cavity; Q, is the mode frequency in the loaded cavity; dp, is the un- 
saturated inversion of atom y; g,, is the coupling constant; v, is the frequency 
at line centre of atom py; y, is the homogeneous atomic width; and T is the 
longitudinal relaxation time. Further, we have for the diffusion coefficients, 
1 2y,(N 2 +N wu 2 
= — _—_ +2k 2n +1 ° 453 
Orme [Date ae Mn+ 2g +1) (453) 
Before giving the (nearly exact) solution of (451) with (452) and (453) we 
present a theorem. 
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b. Theorem on the exact stationary solution of a Fokker-Planck equation” » 


We assume that the drift and diffusion coefficients of the Fokker—Planck 
equation 


ow 0 r) ? 
Ow fy(2e <6) aaa} 454 
ot 3 (a oa Hs 2, On, Ou, du ;* ee 
have the following form 
OB 
= Fe tT), (455) 
OB 
ee (2) 
C; ou toe (456) 
Q1; = 94; Q. » (457) 
We assume also that 
OB oB 
ey (5 F Ot a7) = 0, (458) 
: J J 
Jj 
OF) aT > 
= 459 
| Ou; " a) —e 


The theorem states that the stationary solution of the Fokker~Planck 
equation (454) is given by 


W = Nexp(—2B/Q), (460) 


where N is the normalization constant. It is implicitly assumed that W is 
normalizable which is always the case in laser systems because of the gain 
saturation. 


c. Nearly exact solution of (451) 


Because modes near the atomic line centre have the highest gain, we con- 
fine our analysis to modes for which Q, is nearly independent of A, so that the 
condition (457) is fulfilled. Whether A* and A (452) fulfil the condition (455), 
(456), (458) and (459), requires a detailed and lengthy analysis, so that we 
represent only the results for the important example of normal multimode 
action. 

We assume that the frequency spacing AQ = Q,, — Q,, is large compared 
to 1/T. The important contributions to (452) then stem from those terms in the 
sum where A, = A,. If we further neglect combination tones, the sum over A, 
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reduces to a single term: A, = A. Under these assumptions eqn (451) possesses 
the following exact solution, 


2 
W= Nexp| -9 (BO+B4B}, (461) 
where 
BO) = py K,U,* u,, (462) 
is connected with the cavity losses, 
B® = ¥ AK, u,* u, (463) 
with 
YL 
Ax, =-—) 4, 2 464 
Ky py onl nal (Q,—v,)? +7? ( ) 
is connected with the unsaturated gain, and 
1 
B® = 2y.7 TY) [eel leayl? ould ual? l9 ya)? ——>g—— -» ar 
aa pu A At On Gua HA, (Q,-v,)" +4, (Q,, —v,) +y,? 


(465) 


is connected with the gain saturation (including mode interaction). (If (461) 
with (462)—(465) is specialized to a single mode, the result (442) follows.) 


5. THE GENERAL THEORY OF DISSIPATION AND FLUCTUATIONS 
OF QUANTUM SYSTEMS FAR FROM THERMAL EQUILIBRIUM 


5.1. THE QUANTUM SYSTEM 


We adopt the method of projection operators. We assume that a system can 
be described by a set of quantum states which are distinguished by a quantum 
number /. In the following we always assume that / stands for the complete 
set which characterizes the quantum states of the system and we shall only use 
the property that the projection operators P,, form an algebraic ring with the 
property 

Pix Pim = Pim Oia- (466) 


By means of these projection operators it is possible to decompose every 


operator Q into 
Q = VY HOM Pin (467) 
im 


For aN level atom with a single electron, P,,, can be written in terms of creation 
and annihilation operators, thus P,,, = @,'d,- 
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5.2. THE GENERAL QUANTUM MECHANICAL LANGEVIN EQUATIONS?) 


We assume that transition rates w and damping constants y are given, either 
by a calculation from first principles or, what is of great practical importance, 
phenomenologically. We require further that the fluctuating forces have two 
properties: 


1. they have a short memory, so that their correlation function is propor- 
tional to a 6-function of the time difference (or, in other words, they have 
Markoff character). 


2. they are determined in such a way, that the relation for projection opera- 
tors holds, at least if the average is taken 


<P iz P. in = ba P. tes (468) 


where the P’s are the time-dependent operators under the influence of the full 
Heisenberg operator, which comprises also the effect of the heatbaths. We 
show that the fluctuating forces can then be determined uniquely and ex- 
plicitly by a rather simple formula (see (486) and (490)). They secure also the 
quantum mechanical consistency of the equations of motion for the operators 
P;,. We start with the averaged equations which we write in the general form 


d 
dt Pi i> = Pp» Mi, 1, shado Pay de 
1j2 


The number of levels 1, 2, ... may be arbitrary. Denoting the vector formed of 
P;,;, by A and the matrix formed of M;,;,,;,;, by M (note that i, i, counts as 
a single index) we can represent eqn (469) as 


> (469) 


d 
ay <4? = MAD. (470) 


In it, M contains the transition probabilities (especially the losses), the non- 
diagonal phase-destroying elements produced by the heatbath, and the 
“coherent” driving field. Because these driving fields, if treated quantum 
mechanically, may still depend in an implicit way on the heatbath, the average 
over the heatbath has to comprise both M and A. We now go one step back 
and consider the motion of the unaveraged operators A by adding fluctuating 
driving forces, T. 


A= MA+YF. (471) 


The driving forces can have any form and can especially still depend on P;, 
for instance, in the following form 


I(t) = L(@#)A+N(t), (472) 
where L and N do not depend on A. 


THE SEMICLASSICAL AND QUANTUM THEORY OF THE LASER 295 
In order to come back from (471) to (470) we must assume 
<T> =0. (473) 


It should be noted that we have to impose as an additional condition, the 
conservation of the total probability, expressed by 


N 
y Pa = 1. (474) 
i=1 
The formal solution of (471) can be written as an integral representation 


A= i K(t, DT (0) dt-+A,, (475) 


where A, is a solution of the homogeneous part of (471). We postulate in the 
well known way that the kernel has the property 


. K(t,t) = E. (476) 
The initial condition requires 
Aj,(0) = Piz (477) 


where the P;, are the Schrédinger operators. We now construct 


<ABA) = (( { TO K(t, 2) de+4,) B(| K¢ v’) T(t’) ac'+4,) ) (478) 


where ~ denotes the adjoint matrix. It is assumed that B is a constant matrix 


B= (By) (479) 
having only one nonvanishing element (i, /) = (i;, i23 J; J2): 
Bas, kas taste = Obes, i Oka, in Oty, js Oty, joe (480) 


Using the property (468) we find for the left-hand side of eqn (478) by dif- 
ferentiation 


d 


Sint ‘dt 


Pi, j2.> = 91, ad Mi, j23m, 2 Pmn>- (481) 
By differentiation of the right-hand side of eqn (478) and using the properties 
of (475) we obtain the following terms 

<F()BA)D; ;. (482) 
<ABI(t)>, p (483) 
<AMBA), ;, (484) 
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<ABM A), ;. (485) 


For the evaluation of expressions (482), (483) we use the Markoffian 
property, i.e. 


T1210 5,500 = Gi, iz i, 5, (6-09 (486) 
and 
&T;,,:,(2)> = 0. (487) 


We now have to perform the average over the heatbath where we know 
that the fluctuations have only a very short memory. We assume that the 
response of the particle (or spin) is slow enough so that K and A, can be taken 
out of the average (Note that M contains, at most, the heatbath coordinates 
indirectly over the quantum mechanical (‘‘coherent”’) fields, but in all loss 
terms and phase-memory-destroying-parts they do not appear). Equation 
(482) then gives the contribution 


G, ,/2 (488) 


and the same quantity similarly follows from (483). In order to evaluate (484) 
we contract the product of the A’s by means of the rule (468) where we make 
use of the commutativity of the operators I, with the operators of the driving 
fields within M, which holds in the whole Heisenberg picture. We thus obtain 


(5 Poa fi m,n,i1,i2 Pan) = > <P. mj2 Mi,, i2; mii? (489) 


from eqn (484) and a similar one from eqn (485). We now compare the results 
of both sides, thus obtaining the following equation for the G’s which occur 
in (482), (483) 


Gin; Juda = <{61,,;,M. 17 t1,J2; mn —4,,j j2 Mit; mii —Omi; M,,, jai AD LG See (490) 


This equation represents the required results. It allows us to calculate all 
correlation functions of the type (486) if the coefficients M are given, and the 
solutions P;, of the averaged equations are known. A closer inspection} shows 
that all terms containing external fields cancel each other completely on the 
right-hand side of eqn (490), so that only the transition rates and damping 
constants need to be used for the M’s. In the following we therefore drop the 
field terms. 

In the case of the laser, the incoherent part of the atomic equations often 
reads: 


} W. Weidlich, private communication. 
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d 
dt (a;'a;) = » Wij ay'a,— J, Wie a;'a;+ I, ,(t). (491) 


d , 
a (a;'a,) = — Yj4(ajtay) + Y;,(t) (j # k). (492) 
By specializing formula (490) we obtain 


Gis, iim 5y| x, Wi a,'a,> +) Wi<a;'a;> s W,;<a;'a;> = W,:<a;'a;), 
k ke 
(493) 


Gi, = py WryiXa,'a,> — de WiXa;'a;> + y+ Wi<a;'a>, iF j. (495) 


G 


For many applications (e.g. laser theory) a knowledge of the second moments 
of the fluctuating forces is completely sufficient, because the fields interact 
with many independent atoms. Thus the results (e.g. on the linewidth) depend 
only on a sum over very many independent fluctuating forces which possess a 
Gaussian distribution (the fluctuating forces of a single atom are not Gaussian, 
however). 


5.3 THE GENERAL FORM OF THE NON-MARKOFFIAN DENSITY MATRIX EQUATION 


We now consider the most general form of a density matrix. We assume that 
the system considered is subject to external forces, for instance, external 
electric and magnetic fields, and that it is coupled to external reservoirs (heat- 
baths). Furthermore we admit that the system itself has an internal interaction, 
for example, in the case of spins, the spin-spin interaction or in the case of 
electrons, the electron-electron interaction. These interactions cause transit- 
ions between the different quantum states. We assume, as usual, that the 
equation for p is linear in p. Then the equation for p must be of the following 
form 


t t 
pit) = plta)+] K%2) pact] de | Y KuwelG ) Pu 0) Pee 
to 0 iki’ k’ 


+} Kip, 2) Pix e@) ape Ki? (t,t) PC) Piz (496) 
whete fp is the initial time. As a consequence of causality the integral only runs 


t J.R. Senitzky, Phys. Rev. 161, 165 (1967). 
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from fp to t. The most general form in which the projection operators can 
occur is given by (496) because higher powers of the projection operator can 
always be reduced according to (466). Provided the kernels K depend only on 
t (i.e. K = K(t)), eqn (496) reduces to an ordinary density matrix equation, 
which has been introduced, for example, in the theory of magnetic reson- 
ance’) or in the laser.‘ This equation is 


dt > Rinne (t) Pix p(t) Peet) Ki (1) Pix pO) 


+3, Ku (t) p(t) Pint KO (1) p(@). (497) 


5.4, EXACT GENERALIZED FOKKER—PLANCK EQUATION’) 
We define the distribution function byt 


fv) = N [ exp( = py Viz Xn) u(T] exp(%j: Pix) e) {dx;,}, (498) 
t t 
where N is the normalization constant, and x,, are classical complex quantities 
for which 
Xin = —Xu*. 


The v,,’s are classical variables which are associated with each projection 
operator: 


. a * 
Dig PP 4 Vin = VEG: 


Most important for what follows is the sequence of the exponential operators 
which occur in the trace. We order these operators in the following way 


O = 0, Oy Op = Il EXD (Xi4 Pix), (499) 


where 


O, = (1, 2)(1, 3)(1, 4)... (1, (2, 3)(2, 4)... (2,72)... 
...(n—2,n—1)(n-2,n)\(n—1,n) (500) 


Ou = (1, 12, 2)... (4,7), (501) 


Op = (n,n—1)(n,n—2)(n—1,n—2)...(n,2)(n—1, 2)... 
vn Bs 20 Dee(2, 1). (502) 


{ This type of distribution function has been introduced by Haken er al.‘6*) where instead 
of P,,, the usual creation and annihilation operators of Fermions a,t and a, were used. In the 
one particle subspace a,t a, just obeys the relation (498). The definition (498) secures that f 
is a real function (which is not always positive, however). (498) allows for a number of 
modifications and extensions, 
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We have used the following definitions 


i<k, Xix = Six 


(3, k) = exp Ci Pi) = (1 +in Pix) (503) 
i=k, Xin = Su = i 

(i, k) = exp (¢;; Pi) = exp (4; Pi) (504) 
i>k, Xin = Ein 

(i,k) = exp (64, Py) = (1+é4 Pix) (505) 


The definition (498) allows us to calculate quantum mechanical expectation 
values by ordinary c-number procedures. For instance the expectation value 
of P,,, is given by 


tr (Pyy ) = | Se | dm £(0), (506) 


where the integration runs over all the variables v;,(= v,;*) over the whole 
complex plane. 

If p(t) obeys the usual density matrix equation (497) (which is of first 
order in its time derivative, the heat baths being Markoffian) it is even possible 
to calculate multitime averages of the form 


tr (Pin, (t1) Pijn,(to)- - Pix (tn) P(to)), (507) 
with 
th >t >... > by 


by means of the single time distribution function. 


The equation for the distribution function 


THEOREM: The exact generalized Fokker—Planck equation belonging to (496) 
and (498) is given by 


FO) =f (to) + I " KL, a) f (2) de 


t 
+ { ae{ Y Kier (6 0) py My ik,ay(X) Van 
i # 


‘0 wk’ 


+) Ky Ot, t) > Nie an(X) Vip +) Kit, t) by Nan) P| F(a). 
ik Au ik Ap (508) 
This equation is to be supplemented by the constraint } v,, = 1. Using the 

rule 
Xin > —d/é Vike (509) 
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we define the quantities M, N as follows: 
Miwijan = 2 Aitenn Burin Cratty EXP (Xnn— Xn) 
mn 


where 
Aikmn = Oim O; vy —Xmi Om<i o,*"", 


Bitnn = Skn OR” '— Xin Sn<k OR”, 
Om<i = 1 form < iand = 0 otherwise. 


Om * = Smt Y Xmty Xyty + Myke 
The summation runs over all ways from m to & with 

m<l<l,...<k fork2m 

OFF" =0 form >k 


OR”! = Sma t DY Xt; + Xiah; Xue 

The sum runs over all ways from /to m with 
m>l>tiy...>h>1>Ilform2!1 
O,"*' = Oform <1 


See ee 


Cikim = 6i(Onm—Ox<m Xm)» i< k, l< mM, 
Cik,tm 7 Diitm A 641 Sim i=k,l<m, 


(510) 


(511) 
(512) 


(513) 


(514) 


Eitam = Oxm(i— Xi O:<) EXP (— Xie + Xun), i>k,l>m, (515) 


Crktm =0 otherwise 


F item = Sim(O km — Xtm Om< i)» m= n,i z k 
, Higmn = Sim kn — Xin On< ) 


+ Xin On<m OinO Kn — Xkn On< x)» m>n, iz k (516) 


N ikmn 
\ Bitm'n! exp (Xin'm' —Xn'n') Cs i< k, 
mn’ 

NO, = Ofori>k,m<n 


| Fr = 931 O41 Xp 91 <5 Ox» l=mi<gk 
(2) 


Fittm = Spm (Ou — X11 91<i) 
+644 Xm O1<m(Ou— X14 91<i)s l<mi<k 


ikim >= 
oF Aikmn’ Cains i> k 
m’n’ 


Ni, = Ofori << k,l > m. 


(516a) 


| (517) 
| 
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Because the operator acting on f on the right-hand side of eqn (508) is linear 
in v,,, the Fourier transform of f obeys a partial differential equation which is 
only of first order in its derivatives. Thus the solution of this equation may 
offer certain advantages over the direct solution of (508). 


Derivation of equation (508) 


We first multiply the density matrix eqn (508) with O from the left and 
then we take the trace over the total system. We then have to calculate the 
following expressions: 


tr (OP, pP rir), (518) 
tr (OP; p), (519) 
tr (OpP;,). (520) 


Using the cyclic properties of the trace we can rearrange (518), (519) and (520) 
in the form 


tr (Per OP, p), (521) 
tr (OP;, p), (522) 
tr (Pi, Op), (523) 
respectively. 
LEMMA |: 
0,1 = (1-¥ éaPa)- (524) 
i<k 
Proof: 


We denote the inverse of (i, k) by (i, &) so that 
O,-* = (n—1,n) (n—2,n)... (41,142)... 
GG m+2HLm+ DIG mG, m—1)...0,2) (525) 
ifm+1 <n. 
If m = n, we have instead of (525), the equation 
OL = (— 1, nn—2,n)... C4114 GFT DIGMGn-1)... 626) 


We assume that the expression (524) is correct up to the indices (/, m) i.e. up to 
the bar, so that we may write 


Oo. = (1—€,-1.0 Pa-t28 fatale: 


—C4m+1 Pim+ it~ Sim Pim)» ++ (527) 
ifm+l <n. 
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If m = n, we have instead of (527), the equation 
0,"* = (J Gay gtl ads a — Cin Pix 20 
— Cr+ 1414+2P i+ 1142101 — ern Pin) snes (528) 


We now investigate the left neighbour of /, m. According to (525) and (526) 
the left neighbours have the following indices 


(1, m+1); (,m+2) (529) 
or 
(+1, /+2), 7+1, /+ 3). (530) 
In any case, we have always 
m+1,...1>1 (531) 


so that the following inequalities for an arbitrary pair (7, kK) occurring on the 
left of the bar result 
izl 
~k>I1 (532) 
k>i 
From the definition of O; we have further m > |. From these inequalities it is 
evident that k is not equal to / and thus always 


Pix Pim = 0 (533) 
This proves our assertion up to (/, m—1). 
LEMMA 2: 
0,-* = (1- » zn Pu: (534) 
i>k 
Proof: 
analogous to that of Lemma 1. 
Definitions: 
we introduce the decomposition: 
O, = OFT (Lm) OFF, (535) 
where 


om = (1, 2)(1, 3)... (1, #)(2, 3)(2, 4)... (2,n)...(,m—1), (536) 
ifm > [+1 
and 


om = (1, 2)(1, 3)... (1, (2, 3)(2, 4)... (2,0)... 7-1, n—1)U—1,n) 
(537) 
ifm = 1+1. 
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om, = (,m+1)(, m+2)...(n—2,n—1)(n—2, n)\(n—1, n) 


ifm+l<n 


= (/+1,/+2)(+1, 143)... 


ifm =n. 


In a similar way we define 


Og = Oxr (I,m) ORR 


(n—2,n—1)(n—2, n)\(n—1, n) 
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(538) 


(539) 


(540) 


o'm = (n,n—1)(n,n—2)(n—1, n—2)(n, n—3)(n—1, n—3)(n—2,n—3)... 


if/+1 <n, 


om, = (n,n—1)(n,n—2)(n—1, n—2)(n, n—3)(n—1, n—3) 
.(n—2,n—-3)... 


if] =n. 


om, = (I—1, m)(I1—2,m)... 
...(m,m—1)... 


if7—1 > m. 


oim, = (n,m—1)(n—1,m— 


if? = m+. 


1).. 


Simple relations for derivatives 


Evidently we have for / < 


with 


for/=m 


m 


00 
OC tm 


a0, 
lame 


60 


Oe, 


(m+1, m)(n,m—1)(n—1,m—-1)... 
(n, 1)(n—1,1)... 


.(m, m—1)(n, m—2)(n—1, m—2)... 
.. (4, 2)(3, 2)(n, D(n—-1, 1)... 


_ 00, 
BE 


= OF t Pim OTR; 


Ou 
* Of," 


(m+3,m+1)(m+2,m+1) 


= Ou Or, 


(541) 


(542) 


(543) 


(544) 


(545) 


(546) 


(547) 
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where a0 
BE, = Pu Ou = Om Pus (548) 
nu 
forl>m 
616) 0OR 
—s— = 0, Oy ——; 549 
aE, — Ov Om az, ay 
with 
oO 
ae = OF, Pin Off: (550) 
Om 


We now come to the task of expressing (521), (522) and (523) by derivatives 
of O with respect to x,, (or €,,). We do this in several steps: Rules for shifting 
projection operators and use of derivatives 


LEMMA 3: 
P;,O0 = > Aizmn O, Prin Ou Op, (551) 


where A; jn, is defined by (511) and (513). 


Proof: 


the equation follows from the fact that: (1) the P,,,’s span the basis of an 
algebraic ring, so that we can always express a function of the P,,,’s by a linear 
combination of the P,,,’s and that (2) O,, O,,, Og possess inverse operators. 
We multiply (551) by O,~* from the left and by O,~' O,,~* from the right. 
This leaves us with the eqn 


0, * Pix OL = > Aitmn Pa (552) 


When we take the matrix element with respect to the states ¢,, and ¢, on both 
sides, we find immediately 


Aes (4_(Pu- éusPox) Or é,) (553) 


where we have inserted the explicit form of O,~ 1 according to eqn (524). We 
further introduce the abbreviation 


Op" = bm Or bx). (554) 


Expanding O, into the powers of P;, and using the property of the projection 
operators we immediately find the formula (513). 
Introducing (554), (513) into (553) we obtain (511) 
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LEMMA 4: 


OP i; 7 by Bikonn 0, Ou Tes OR (555) 


with Bipmn given by (512) and (514). 
Proof: 


analogous to that of Lemma 3. 
We now express O; Pn, Oy Og by a linear combination of derivatives of O 
with respect to x;,. 


LEMMA 5: 
Fori<k a 
O, P ik >= Sy Ciktm ae O, (556) 
i<m Cm 
with Cipim given in (515) 
Proof: 
Using (546) we write (556) in the form 
O, P ik = oe Cikim Or P tn Ofr (557) 


By multiplying (557) with 0, ~? from the left and from the right we obtain 
Pi,0,7>* = Y) Cittm (O%R)~ * Pim (O81)? (558) 
l<m 


The right-hand side of eqn (558) follows from 


O,”* OFF, = (OfR)* Gm), (559) 
Ofer O,* = (m\(Of)"', (560) 

(compare (535)) and 
(1, m) Pim (i, mM) = Pim, for 1 # m. (561) 


For the further evaluation of the operator product on the right-hand side of 
eqn (558) we note that 


d ee dhs 
~ ae Ol) Gm) On 
Im 


(o¥%)-* P,, (OlN)~! 
(562) 

d 
~ 4g, 7 


= 
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Due to (524), (562) reduces to 


(OTR) * Pim (OF) * = Pim: (563) 
Thus eqn (558) takes the simple form 
Pix 0,"* 7 > Ciktm Pim (564) 
l<m 


Taking the expectation value with respect to the states / and m on the left- and 
right-hand side we find immediately 


Ciktm = (4 Pu (1 | a ca Pas) 2) (565) 


Cittm = 911 8pm —9it S km Ok<m (566) 


LEMMA 6: fori =k 


O, F ii Ou = >» Di, (O, Om) (567) 


im = 


with D given by (515). 


Proof: 
Use of (546), (548) yields 


O, Pi, Oy = p> Diam OF Pim O'R Om +) Diy O; Py Oy. (568) 


We multiply (568) from the left with O,~* and from the right with O,,~* 
O,~ ‘. This leaves us immediately with 


P, 0,7 = y Ditm Pim+ >, Din Pn 0,~* (569) 
i<m I 
which, by use of (524), becomes 
P,O,"* = 2 Pin (Dittm — Din Sim) +d Din Pu (570) 


after the indices have been rearranged. We first take the expectation value 
with respect to the states /, / which yields 


Diin ar (4. ti (1- 2 Cav Py») 4 ‘) = 61-9 dz ow dy, (571) 


We now investigate / 4 m. Taking the matrix element on the left and right 
hand side with respect to the states /, m, we find 


(4 Pa(1 4 2, as a) ta) = (Diitm at Diin Cim) Oem (572) 
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and by use of (571) we see that 
Dim = 0. . (573) 

LEMMA 7:1 > k 
O, Pi, Ou On = pay E; 


FEO =F Ean Or O, (574) 


fs) 
Kim O mee 
waar am Obes 


with E,,;,, defined in (515). 


Proof: 
Analogous to that of Lemma 5. 


A second way of shifting projection operators: 


In the foregoing analysis we have first shifted the projection operators 
from the left or the right to the middle of the product O; Oy Og. Then in the 
next step we have shifted the P,,,’s to their places which would correspond to 
their order in the decomposition (535), (540). Under certain circumstances it 
leads to simpler formulas if we immediately shift the projection operators to 
their correct places. 


LEMMA 8:i <k 

é 00m 
Py,0, Ou = ¥ Fitim (| 01) Out) Finn O. <" (575) 
i<m Oem i clan 


where F;,;,, is given by (517). 
Proof: 

Multiplying ae expression from the left with O,-1 and from the right 
with O,,~* O,~*, we obtain in the same manner as above, the equation 


0,71 Py = 2 Fitim Pmt 2, Fin Py OL *, (576) 
and using (524) we get 
0,7 Py = p> Pan (Fiztm — Finn Sim 81<m) +h Fign Py. (577) 


Equation (577) can be rearranged in the form 
O LP = 2» Fittm Pim +) Fin Pu (i- » Cs Pyx) (578) 


where use of (466) has been made. Taking expectation values on both sides of 
eqn (578) with respect to states /, m, we obtain immediately (517). 
LEMMA 9: (i > k) 

| O(Om Op) 


ae. (579) 


Ou On Pix <e Dy Finn 
men 


where Hin, is defined in (516). 
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Proof: 
Analogous to that of Lemma 8. 


Evaluation of (521), (522), (523) 


We are now in a position to put our previous results together in order to 
calculate the expressions (521), (522) and (523). 


1. Evaluation of (521) 


Py, OP yy = ¥, Aikmn O1 Pmn Om On Pri (580) 
(compare (551)) 
Using 
OnPev = ¥. Bermy Purw On: (581) 
we obtain 
Pip OP yy = - ba Aitnn Byirm'nt 91 Pin Ou Pn Op: (582) 
Using 
Ou Piva! = Pin Ou exp (Ca —Mn')> (583) 


we transform (582) into 


Pix OP yy = py ys Aiton Byimn' OL Pi Pay’ Ou exp (Ym —Nn) Op, (584) 


mn m’n’ 
which by use of (466) reduces to 


> Aika By'itnnt 91 Pn On EXP (Nn Nn) Op- (585) 
mnn’ 


According to (556), (567) and (574) we have, quite generally, the equation 


7) 


‘, 
me OXay 


0, Paw’ Ou Og = » Ca 0, (586) 
AL 
where 
Chnan for m<n',a< yp 
Comin =\ Dray for m=n',A<p (587) 
Emnjan for m>n',A> yp. 


=0 otherwise. 
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Inserting (586) into (585) yields 


0 
Py, OP yy = Aitkmn Beinn Croeig eXp (Y4,— Mn) a.. 0. (588) 
mann’ Ap Xan 
Thus, we obtain finally 
é 
Py, OP ey = Y Miwiap (x) Dx. O, (589) 
Au Xap 
with 
Mikeitan = » Aiton Beir Cinta EXP (Ny — M1): (590) 


The coefficients A, B, C, are given by the expressions (511), (512) and (515). 


2. Evaluation of (523) 


We distinguish between the cases i < k and i > k. If i < k we apply (575) 
If i > k we first use (551) which yields 


P,O = y Aikmn O, I va On Op. (591) 
Inserting 
0 
O, a Ou OR = >; Cry OX.w O (592) 
ml Xm'V 
into (591) we find 
61 0) 
P; = ZF. 3 
k O > OX im 
where 
T im = y Aikm’n' Cie: (593) 


3. Evaluation of (522) 
In a similar way we treat OP;,. For i > k we apply (579). For i < k we have 


OP; = O. Ou os Bitmn Ps OR. (594) 


We find 
OP;, = »y Birnn 0, y gt exp (Ym —"n) Ou Op- (595) 


Inserting (586), that is 


ae, (596) 


’ 
OXtm’ 


0, P. mn Ou Og ai >» C wat'a 
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into (594), we obtain 
cee ee ey ee 9 
OP ix Din m OX O (5 7) 
with 
Nigrm’ = >» Bitmn exp (Nm = Nn) Creatas (598) 
mn 
Final step 


We insert the explicit expressions for (521), (522) and (523) into the 
equation 


te(Op(0) = tr(Op(a))+ [dr {  Kuve(t 2) (Prv OP 00) 
+9) Ky (t, 2) tr (OP, (0) +), Ky (t, 1) tr (P.O p(e))| . (599) 


In order to obtain an equation for the distribution function f, (498), we multi- 
ply (599) by 


exp =) Vik x] 


and integrate over the space {x,,}. Thus the left-hand side of (599) becomes 
identical with f(t). The traces on the right-hand side of eqn (599) have the 
form of derivatives of O with respect to x,;,, which are multiplied by functions 
of x;,. These operations can be translated immediately into ones with respect 
to the variables v;,, if the following wellknown rules are observed: 


0/0x;, + multiplication by v;;, 
multiplication by x, > — 0/dv,; (600) 
Having these rules in mind we immediately find eqn (508) which was presented 
in 5.4. 


In conclusion we discuss a constraint which stems from the normalization 
of any quantum system, which is expressed by 


» Pi = 1. (601) 


We include (601) explicitly in f (498) by the following new definition of f, 
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f= nf fdxj,} exp | . Viz xa] 


1 ro 6) 
xs [ dé exp (—i€) tr (0. exp (zyP,) Ou Or e) (602) 


Having in mind that Oy = exp {x,;P,;} we introduce the new variables 
Xj; = Xj; +i¢ which leaves us with 


f= v{ {dx;,} exp fe Vix x] exp [-5 vj iy} tr (Op) 


J 


xs [exp (« (x Vjj- ') dé. (603) 


Evidently the definition (498) of f agrees with that of (603) if the constraint 
> vj; = lis added to the definition (498). 
i] 


Many subsystems: projection on to macroscopic variables 


Our eqn (508) applies to a single atom with two energy levels, or a spin}, 
to multi-level atoms, as well as to a complicated many-body problem. For a 
discussion of its solution it is thus certainly necessary to treat specific ex- 
amples.f On the other hand, under certain circumstances it is possible to per- 
form strong simplifications even in rather general cases. Indeed, in many cases 
of practical importance, the system under consideration consists of many 
equivalent subsystems. Examples are spins in external fields, or laser atoms 
interacting with the light field. In these examples the individual subsystems 
interact with the external fields in a similar way, i.e. by their total (macro- 
scopic) electric or magnetic moments (after certain phase-factors have been 
transformed away). In these cases it is advantageous, first to introduce pro- 
jection operators for each subsystem yp: P,,,, and to formulate the density 
matrix equation by means of these P;,,,,. It reads 


p(t) = Lott), (604) 


where L is a linear operator (see eqn (496)). In our present case, L consists of 
asum over the subsystems: 


L=>L,(Piw) = YL Pity) (605) 
B B 
where the latter equation expresses the fact that the subsystems are equivalent. 


f e.g. in the case of a two-level atom, f is built up of 6-functions (W. Weidlich, private 
communication). 
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We now introduce macroscopic variables 
Vin = » Vikn (606) 
B 


and define 


f() = ( {du} exp { ~Y xu Vi} t (1 0,) °) (607) 
where 


0, = I] exp (Xi Pixy)- (608) 
ik 


Because the O,’s commute, we may write 
[]9, = [| 9, .0,, 
v v 
where (4) indicates, that the factor v = p is to be omitted. We multiply 


eqn (604) from the left with O and take the trace with respect to the whole 
system, thus 


tr (Op) = tr (OL p(z)) = by tr (OL(Pix,)p) 


= » tr ( rr 0,) tr, (O, L(Pizy) »)| ; (609) 


It has been shown that 


tt, (0, £(Pa)p) = A xu, : Je, (O, p). (610) 


OX, 


Note that O, is a function of x;, due to (608). The right-hand side of eqn (609) 
can thus be written in the form 


(609) = 2 tr |” 0,) h (xu, x] tr, (O, | (611) 


=Str [ne O,.h (xi a) 0, P|. (612) 
73 i 


Note that we can always use the cyclic properties of the traces, if needed. 
Now let us make use of the fact that / is of first order in the derivatives 
6/0x, (compare eqns (508) and (600)). From Leibniz’s rule for differentiating 
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products, we immediately find 


tr(Op) =h (xu, oa tr (TI 0, p). (613) 


Multiplying eqn (613) with 


wx (-E 7%) 
ik 
and integrating over x;,, we obtain 


7) 


1 Vw!) = 1( Yaw 55- 


)£ Vin (614) 
where y is a linear operator defined on the right-hand side of eqn (508), if the 
UVj,'S are replaced by the V;,’s. A simple analysis shows that the condition 
>) 04 = 1 is now to be replaced by ¥ V;, = N, where N is the number of sub- 
i 


systems. The importance of the result (614) is that while f (v,,, ¢) for a single 
subsystem may show wild fluctuations (e.g. it consists of 6-functions), the 
macroscopic variables V;, possess a smooth distribution, which changes but 
little. Thus in many cases of interest it can be shown that we may neglect 
derivatives higher than second order, so that (614) then actually reduces to an 
ordinary Fokker~Planck equation. 


APPENDIX 


QUANTUM MECHANICAL LANGEVIN EQUATIONS’ © 


We shall discuss the origin of quantum mechanical Langevin forces for the 
field (the effect of heatbaths). 


The light field 


We shall show by means of an example, which can be treated exactly, 
how quantum mechanical Langevin forces arise. For this end we couple the 
light-field to a heat-bath which consists of an infinite number of harmonic 
oscillators. Thus the Hamiltonian of the bath has the form 


> ho B,* By. (615) 


The bath variable occurring in the interaction Hamiltonian (297) is chosen as a 
superposition 


B=h¥ ga Bas (616) 
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or, in the interaction representation, 


Bit) =hY¥ 9, B, exp (iot). (617) 


The total interaction Hamiltonian has the form» 
Aeieia van = Bf Iw B. exp (iat) +b by Jo* B,' exp (—iot). (618) 


The Hamiltonian of the free field is given by 


ha, bt b. (619) 
The Heisenberg equations for b,* and B,‘ are 
db'/dt = iw, b'+ i> g,.* B,* exp(iat). (620) 
and 
dB,,'/dt = ibt g., exp (—iwt). (621) 


Equation (621) has the solution 
t 
BJ =i | bf (t) 94 exp (—iwt) dt+B,* (to), (622) 
to 


where B,,'(to) is the operator at time fy. Inserting (622) into (620) we arrive at 


t 
— = iw, bt- i bt(t) ¥ Idol? exp [io(t—1)] dt +i ¥ g,,* B,* (to) exp (iat). 
ee 2 (623) 


We now assume that the g,,’s are of about equal amplitudes so that we have 
oo ; P 
i lg|? exp [io(t—1t)] dw = K[a0—0)+ cE (624) 
0 mT t—T 


k = nlgl? 


If we allow also for negative frequencies, then for harmonic oscillators with an 
inverted energy scale we would find, instead of (624), the equation 


| ar exp [iw(t—1)] dw = 2x d(t—7). (625) 


For the evaluation of the integrals it must be noted that the 6-function has the 

property z 

| spades: (626) 
0 


Equation (624) corresponds to the choice made by Senitzky, whereas the 
choice (625) was made by Lax. We see that (624) is more physical but (625) 
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has certain computational advantages. The actual difference between (625) 
and (624) is negligible in practical calculations. Under these assumptions 
(623) becomes 


dbt : . 
Gp = io bt bt +i » Jo* B,' (to) exp (iat), (627) 
F'(t) 


where the last term is evidently the fluctuating force. 
We determine the properties of this fluctuating force and evaluate 


CFI) F(t)>s = Y gal? exp Lio(t—1’)] tr (B,t By exp [Hy /kt)Z,.~! 
=F Ig exp E—io(t—1) 8 (1) “e) 
and Peed 
FOF» = Yigal? expL—io(e—1] HoT) +1), (629) 


where fi,, is the occupation number of the mode at the frequency w of the 
heatbath. The bath average over the commutator has the form 


fens |g Pp 
x(8¢0-1 d+ <3) (630) 
2x6(t—t") | 


Equations (628) and (629) can be directly expressed by the damping 
constant x only for the temperature T = 0. For higher temperatures it must 
be noted that in practical calculations 


<F'(t) F(t’)> = Y lol? exp [io(t—t’)] <B,! B.), (631) 


CFO FGDs = ( 


and 
<F(t) F4(t’)> = ¥ lgal” exp [iw(t—#’)] XB, BY, (632) 


always appear under an integral which contains a factor exp (iw@of), where wo 
is the frequency of the harmonic oscillator (light field). If this is taken into 
account, (628) may be case into the form 


» Igal” exp [io(t—t')] #,(T) = 2x 1,,(T) d(t—t’), (633) 


and dGiiadty 
(F(t) Ft(t’)> = 2x(H,,(T) +) 6(¢—-t’). (634) 
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The 6-function in (630) and (633) expresses the fact that the heatbath has a 
very short memory. As may be seen from (623) and (624) it is essential for the 
derivation that the heatbath frequencies have a spread which covers a whole 
range around the oscillator frequency wo. We solve eqn (620) explicitly 


bt = bt(0) exp (imp —Kk) t+ I exp [(i@) —K)(t—1)] Ft(t) dt (635) 
0 
and get 


© (bt yy = <bt +B bys 


ll 


—2x<bt b>g+ [<r F(t)>, exp [— tics, +K)(t—t)] dt 
i) 


+ { <Ft(t) F(t)>, exp (ia) —K)(t—1) dt. (636) 
The integrals may be evaluated using (633) and (634) so that we find 
s <bt b>, = —2K<bt bg +2Kn. (637) 
Tn analogy to (636) we form 
<b, bY}>s = —2«<[b, bY]> + 2k. (638) 
The general solution reads 
<[b, b'}>,, = exp (—2«t)C+1. (639) 


If we insert the initial condition 


<6, b')>g0 = 1, (640) 
we find 
C=0, (641) 


which means that the commutation relation is preserved for all times. 


Oo © ~I 


10. 
11. 
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Optical Resonators 


G. TORALDO DI FRANCIA 


1. [INTRODUCTION 


The threshold value of the pump power in a typical laser is given by 


WwW phy 


vamp = 
phot 


where p represents the number of modes which can interact with the emission 
line of the material, @ is the quantum efficiency, and f,,,, denotes the time 
during which a photon can interact with the material. W,,,,, can be reduced by 
making p small and f,,., large. Such favourable conditions can be achieved by 
using a resonant cavity having a high value of Q (= 27Vvt,,,.) and chosen so 
that only a small number of modes (possibly only one) interacts with the 
atomic systems. Resonant cavities have been widely employed in microwave 
devices and these techniques have now been applied to optical systems. 

A closed cavity with perfectly conducting walls has an infinite number of 
resonant modes of different frequencies. If we want only one mode to reson- 
ate within a fairly wide frequency band, the size of the cavity must be of the 
same order of magnitude as the wavelength, and it is practically impossible to 
achieve this at optical frequencies. When the size of the cavity is much larger 
than the wavelength, the number of modes per unit volume which can reson- 
ate in a frequency interval dv is (8xv?/c*) dv. For example, a 1 cm?® cavity 
used at optical frequencies would have more than 10® modes, even within a 
bandwidth, dv/v, of 107°. 

However, a closed cavity with perfectly conducting walls represents a very 
idealised situation. In order to use the field inside the cavity, it must have an 
opening through which energy can be transmitted to the rest of the system. 
If the losses are high, the value of Q is reduced and the width of the resonance 
curve increases. If Q is small for a whole set of modes whose frequencies are 
closely spaced, then the separate frequencies will merge into a continuum. 
Strictly speaking, a lossy cavity does not have discrete modes, but only pseudo- 
modes or virtual modes. The situation is analogous to the case, in quantum 
mechanics, of a system of particles with positive energy. Roughly speaking, 
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the meaning of a pseudo-mode is that when a photon is in such a state, it has 
a high probability of staying inside the cavity for a long time. In the case of an 
ordinary microwave cavity with waveguide terminals, however, it is still fairly 
accurate to speak of discrete modes. 

The situation is different for an open resonator, i.e. for a cavity where a sub- 
stantial part of the walls has been removed. A large majority of the modes 
have very low Q and merge into a continuum. However, it was pointed out by 
Schawlow and Townes") that if the size of the cavity is very large compared 
with the wavelength, as necessarily happens in optics, a few particular modes 
may have relatively low losses, due to the directivity of the radiation. The 
triple infinity of modes which can resonate in a closed cavity reduce to a single 
infinity, and it may then be possible to have only one or a few modes resonat- 
ing within the atomic linewidth. 

The active material filling a laser cavity may substantially influence the 
modes, but for most cases, we will ignore this influence and consider the reson- 
ator to be situated in free space. 


2. THE FABRY-PEROT CAVITY 


The above reasoning was first applied to a Fabry—Pérot cavity, Here the 
radiation bounces back and forth between two parallel flat mirrors. One or 
both end mirrors are made slightly transparent, to provide a coupling with 
external space. 


Waveguide 


Fic. 1. Diagram showing the two mirrors of a Fabry—Pérot cavity forming the end bases 
of a waveguide, of length d and square cross-section of dimension 2a. 
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An approximate theory of this multimode resonator was proposed by 
Schawlow and Townes.“ They assumed that the two mirrors form the end 
bases of a section of a waveguide with metal walls (Fig. 1). Let the wave- 
guide have length d (along the z-axis) and a square cross-section of dimension 
2a (in the xy-plane). The frequencies of the modes of oscillation of this 
rectangular cavity have the following form 


c q\? r\2 5s \2}4 

r= a(S) + (&) + Ga) | sa 
where q, r, s are positive integers, c is the velocity of light and 7 is the refractive 
index of the medium. Each mode can be generated by a plane wave reflected 
back and forth between the walls ) and characterized by the direction cosines 
+rd/4a, +sA/4a, +qA/2d, where J is the wavelength in the medium. 

For the present purpose, we need consider only those waves which make a 
very small angle with the axis of the waveguide and which radiate almost end- 
on through the slightly transparent mirrors. For such waves, g is approxi- 
mately equal to the number of half wavelengths contained in d, and has a very 
large value (10°—10°), while r and s are small. In this case, if the ratio 2a/d is 
not too small, (2) can be written as 


c (2 r°d s*d ) 


DA — 


a 8qa? oe 8qa? 3) 


If we denote the reflection coefficient of the mirrors by «, a nearly-axial wave 
will decay in a time ¢ given by 


nd 


t= W=ae é (4) 
Hence 
2nd 
Q= ry (5) 


If a = 0-98 and d = 1 cm, Q is of the order of 107. 
Now we have from (3) that two modes with values of g differing by 1 will be 
separated in frequency by approximately 
ee 
Av, = —. 6 
Ya = Ond (6) 
If « is close to unity, it follows from (4) that Av, > 1/t. This means that suc- 


cessive axial modes are discrete. If the bandwidth of the transition is less than 
c/2nd, it will be possible to set a single axial mode into oscillation. Alterna- 
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tively, discrimination against unwanted axial modes can be achieved by an 
extra reflecting surface suitably positioned outside the laser.©? 

However, modes with the same value of g and successive values of r are 
separated in frequency by an amount, 


(r+3)d 
=< 


Ay see 
8nga* 


B (7) 
and a corresponding result may be found for s. A typical value for d/2a in a 
solid state laser is 10. Hence, with « = 0-98 and r of the order of unity, we 
have Av, <1/t. This means that the lateral modes are almost degenerate and 
may merge into a continuum. They may therefore be set into laser oscillation 
simultaneously. 

In principle, lateral modes may be separated by their directional properties. 
Each mode will radiate into four directions, characterised by the angles 
+rd/4a, +s5/1/4a. Hence the directions of radiation of the modes form a regular 
array of spacing 4/4a. However, the finite size of the end apertures will give 
rise to diffraction, each beam having a half-width given approximately by 
A/4a. It would therefore be just possible to isolate a single mode by focussing 
the radiation onto a screen with a suitable pinhole. 

Let us now remove the side walls and deal with a true Fabry—Pérot resonator. 
For small values of r and s, the modes are very nearly the same as those of the 
closed cavity, the only difference is that, due to the absence of the side walls, 
they continually spill energy into external space. This will reduce the Q of all 
modes, and the effect will be more pronounced, the higher the values of r and 
s. The resonator may loose energy in two ways; firstly, by reflection, oblique 
waves after being reflected back and forth will gradually walk off the end mir- 
rors, and secondly, by diffraction. 

The time ¢’ for a beam to walk off the end mirror is readily found to be 
given by 


8 Zz 
ae: (8) 


rca 


This is equal to the decay time ¢, given by (4) when 


8a7(1— 
pa eae): 


Ad @) 


For a = 0-98, d= 10 cm, 2a = 0-5 cm, A = 5 x 107° cm, we obtain r = 20. 
Hence, modes with r > 20 have a Q that is less than half the Q of the funda- 
mental mode. If the pump power is not too high, these will be below threshold 
and will not oscillate. However, we can conclude, even before taking diffrac- 
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tion into account, that a few tens or hundreds of lateral modes are likely to 
lase at the same time. 

The first satisfactory treatment of the Fabry—Pérot resonator was given by 
Fox and Li.” They treated the waves as scalar quantities. This approach can 
be justified since the size of the mirrors is large compared with the wavelength 
of the light and all angles of interest are sufficiently small. 


Fic. 2. Diagram showing the arrangement of mirrors, M, and M2, of a Fabry-Pérot 
resonator, 


Their treatment can be summarised as follows. Let us assume that the two 
mirrors M, and M, (Fig. 2) are identical rectangles. If we start with a field 
distribution u,(x,, y,) across M,, by Huygens’ principle we will obtain a 
distribution u2(x2, y,) across M,, given by 


exp a 


tia (25 V2) = a tis :) ee cos 0 ds, (10) 


where @ is the angle between P,P. and the normal to the mirrors, r denotes the 
distance from (x,, y;) to (x2, ¥2), k = 22/4 and a factor exp (—iwt) has been 
omitted. 

The normal modes or eigenfunctions of the resonator are obtained by re- 
quiring that the field distribution over M, reproduces itself over M,, so that 
Uy = Ou,, where o is a constant factor. As a result, we obtain the integral 
equation 

oUu(X2, V2) = =| u(x, V4) ae, cos 6 dx, dy,. 653) 
M1 r 
Each eigenfunction u represents a mode of the resonator and the eigenvalue o 
specifies both the attenuation and the phase change in one trip across the 
resonator. 
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When the mirror separation d is much larger than the lateral dimensions 
a, b, eqn (11) can take the simpler form 


t 


ou(as¥2) = a5 | and exp | ik 


(x1 ~X2)? +(%1 — 2)? 


7 | dx, dy, (12) 


where a factor exp (ikd) corresponding to the normal optical path has been 
omitted on the right hand side. The argument of the complex number o will 
now be called the phase shift. 

Equation (12) is valid when a?/dA < d*/a*, where a represents a typical di- 
mension of the mirrors. If we introduce the Fresnel number 


a 
N= —, 13 
ql (13) 
we can write this condition ast 
dz 
N< a. (14) 


By introducing the dimensionless variables €, , = V N 22 N12 =v N oft: : 
eqn (12) becomes zs 7 


ou(E,, 2) =i iv u(Esm1) exp {mE —E>)? + (a —ma)*T}d2, dn, (15) 


where .@, represents the domain of &,, 4, space corresponding to the domain 
M, of x,, y, space. It can be seen by inspection that for a set of geometrically 
similar mirrors, eqn (15) depends only on N (through ./#,). 

In the case of the rectangular mirrors of Fig. 2, we can separate the variables, 
by putting u(é, 4) = u(€) v(n). The integral equation for u becomes 


+JN 
cus) = exp (—in/4) |” u(@,)exp Lin, -EY'1db. (16) 


It is obvious that u and v represent solutions for infinite-strip mirrors, parallel 
to y and x respectively. Thus, the eigenfunctions and eigenvalues for the rect- 
angular mirror are products of the infinite-strip eigenfunctions and eigen- 
values respectively. Specifying the different eigenfunctions by subscripts, we 
shall write un, (E54) = Un(E) v, (4) and Onn = CnFn: 

Equation (16) is a homogeneous Fredholm equation of the second kind with 
u(€) as the kernel. The eigenfunctions are orthogonal over the surface of the 
mirror. 


+ Actually a closer investigation shows that condition (14) is not as strictly necessary as might 
appear. Equation (12) turns out to be valid under much more general conditions.6-7 
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Fox and Li® have solved the integral equation numerically, by an iteration 
method. Starting from an arbitrary initial field at M,, one can compute the 
field produced over M, by Huygens’ principle. This new field distribution is 
then used to compute the field produced at M, as a result of the second transit. 
By repeating this calculation over and over again, one finds that the relative 
field distribution approaches a steady state. Depending on whether the initial 
field is symmetric or antisymmetric, one ends up with the lowest symmetric or 
antisymmetric mode. 

Figure 3 shows the results obtained by Fox and Li for the lowest even 
(symmetric) and odd (antisymmetric) modes, for different values of N. The 
power loss and phase shift are both found to be very rapidly decreasing func- 
tions of N. At resonance, the separation d of the mirrors is an integral number 
of half-wavelengths plus a very small additional amount corresponding to the 
phase shift. The numerical analysis has confirmed that the wave guide ap- 
proach is a fairly good approximation and can help to understand several 
qualitative features. 

A very elegant approach to the theory of the Fabry—Pérot resonator was 
given by Wainstein.“ The flat mirrors are treated as the walls of a two-di- 
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Fic. 3. Curves showing the amplitudes and phases of the field distributions using different 
values of N, for: 

(a) the lowest even (symmetric) mode; 

(b) the lowest odd (antisymmetric) mode. 
(After Fox and Li®.) 
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mensional waveguide, open at both ends (Fig. 4). A travelling mode very near 
cut-off consists of a plane wave travelling almost perpendicular to the walls 
and reflected back and forth by the walls. When the wave arrives at an open 
end, it is almost entirely reflected back into the same mode travelling in the 
opposite direction. In this way the condition for axial resonance of the Fabry— 
Pérot resonator becomes a condition for lateral resonance. By applying the 
theory of open-end waveguides, Wainstein was able to derive approximate 
expressions for the field distributions, the losses and resonant frequencies. The 
overall agreement with the results of Fox and Li is very good. The merit of 
Wainstein’s theory is that one can obtain expressions for the quantities of 
interest in closed form. 


Open~end waveguide (after Wainstein) 


‘ / hi\ # ‘ / 

\ ! \/ \] 
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Fic. 4, Following Wainstein’s approach, the mirrors are treated as the walls of a 2-d 
waveguide. 


It is of practical interest to consider the case where the medium between the 
mirrors is non-linear (saturable gain). This case can be dealt with by assuming 
the non-linear medium to be concentrated in two infinitely thin slabs close to 
the mirrors. Equation (11) can be suitably modified to solve this problem“ ~ +”? 
and it turns out that the presence of the non-linear medium alters the modal 
phase distributions, without appreciably affecting the amplitude distributions. 

The iterative method is especially suitable for the determination of the 
lowest-loss even and odd modes. Determination of higher order modes, while 
not impossible, is rather cumbersome, and to make such a determination easier, 
Fox and Li”) and Wells!» have elaborated other numerical techniques. The 
principle of the Fox and Li method, called “resonance excitation” is as follows. 
A resonator is illuminated with a fixed frequency through one of the end mir- 
rors and the length dis varied until resonance is reached for a prescribed mode. 
The field distribution across the opposite mirror turns out to contain predom- 
inantly that mode. If a number of identical resonators are arranged in series 
and suitably separated by isolators, the field distribution across the output 
mirror virtually coincides with the prescribed mode. In practice, a sequence of 
three resonators is sufficient for determining a mode. 
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Alternative approaches to the theory of the Fabry—Pérot resonator are 
based on the use of variational techniques either in coordinate or momentum 
space.‘!+~ 18) These approaches must be applied with some caution, otherwise 
they may lead to inaccurate results. When correctly applied, they become very 
laborious and require the solution of a system of many homogeneous linear 
equations and the associated determinantal equation. Risken,“® however, pro- 
duced an accurate solution quite simply. 

Another interesting approach is to consider a mode of an open resonator as 
a free oscillation of the system.2°~ 2*) One looks for the conditions (a complex 
frequency) under which an outgoing field can exist in the absence of an in- 
coming field. This approach can give results in quite good agreement with 
those found by Fox and Li. 

Some computations have been made for the case when the plane end mir- 
rors are not perfectly parallel, but are slightly tilted. °* 1© It turns out that the 
power losses of the modes increase very rapidly with the tilt angle, especially 
for large values of N. Even a very small tilt, corresponding to departure from 
parallelism of a fraction of a wavelength at the edge, can give rise to sub- 
stantial loss. This means that the alignment of the plane-parallel resonator is 
very critical. Moreover, the lowest order modes tend to become degenerate in 
loss. They will be present at the same time in the laser and will give rise to beats 
at the detector. 

Experimental verification of the behaviour of a Fabry—Pérot resonator is 
very difficult to carry out at optical frequencies, mainly because of the tremen- 
dous accuracy with which the resonator must be built and adjusted in order to 
obtain significant results. However, a very accurate experiment can be carried 
out with microwaves. ‘°° ?® If the mirrors are flat to within 1/300, the mode 
distribution coincides with that described by Fox and Li. 


3. THE CONFOCAL RESONATOR 


A typical confocal resonator is formed by two spherical convex mirrors with a 
common focus. There is, however, a large family of systems which behave 
optically in the same way as the confocal resonator.” 2®) We will first outline 
the general properties of this family. Let two plane mirrors M,, M, (Fig. 5) be 
placed at the focal planes, 2,, 2, of an ideal (free from aberrations) optical 
system S. The light beams in both directions will be assumed to be limited 
solely by the finite sizes of the mirrors. 

A given field distribution u,(x,, y,) across 2, will give rise by inverse inter- 
ference to a set of plane waves, each one specified by the first two direction 
cosines «, 6 of its propagation vector. The complex amplitude of a wave at the 
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origin O, will be denoted by A(«, B) da df and can be evaluated by the Fourier 
transform‘??? 


1 
A(a,f)= [uC ydexpl-ik(axr+ By d]dxi dy. 1) 


X44 h Xe 
| { 
i [ 
| | 
| 
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Fic. 5. Representation of a confocal resonator system. =, and 22 are the focal planes 
of the optical system S. 


System S will focus each plane wave A(a, 8) onto a point P, of £,. Since S 
has been assumed to be perfect, it will obey the sine condition.°™ Thus the co- 
ordinates of P, are 


x2 =f, yo = SB, (18) 


where f represents the focal length of S. 

It can be shown ‘°° that the energy carried by the wave is equal to 
Ci?|A|? da dB, where C is a constant of proportionality. If u. is the ampli- 
tude of the field across Z,, then the energy carried by the wave is equal to 
Clu.|?dx, dy,. From eqn (18) the area illuminated, dx,dy., is equal to 
f? da dB. Hence |u.| = A|A|/f. To find the phase, we note that the optical path 
O,BB’P, of any wave A(«, f) is identical to the optical path O,AA’O, of the 
axial wave; this optical path will be denoted by d. Moreover, we must take 
into account the effect of anomalous propagation, °°”) by which a spherical 
wave arrives at its focus with a phase shift of — 2/2. Hence we can write 


usl292) = — APE 40a, f) (19) 


If M, and M, are equal and we require that u, be proportional to u,, then we 
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find the following integral equation for the field distribution u on wny one of 
the mirrors; 
ae | u(X1,y1) exp | Fem +y.93)| dx,dy, 
St M1 tf 
(20) 

We will assume the contour of the mirrors to be symmetrical about the x 
and y axes. It is then an easy matter to prove that the eigenfunctions of (20) 
can then be split into a set of modes symmetrical about the origin and a set of 
modes antisymmetrical about the origin. The integral (20) turns out to be real 
for a symmetric mode and pure imaginary for an antisymmetric mode. Hence 
we find that the total phase shift for a round trip in the interferometer is 
2kd—x for a symmetric mode and 2kd—x+7 for an antisymmetric mode. 
Consequently, symmetric modes will resonate for 


ou(x2, 2) = 


A A 
= = 21 
aaa eG (21) 
and antisymmetric modes for | 
d= I> ; (22) 


where q is an integer. 

The modes turn out to be highly degenerate, and can only be discriminantly 
selected if they have different diffraction losses. 

Solutions of eqn (20), in particular cases, have been discussed by Goubau 
and Schwering¢” and by Boyd and Gordon.©” 

It is convenient to include the factor exp [ —i(kd— /2)] in o, to introduce 
the Fresnel number N = a?/fA and to introduce reduced coordinates analog- 
ous to those of the previous section. Equation (20) can then be written in 
standard form 


ou(es,N2) = it u(E,,1) exp [—27i(E 62 -+0i2)] dé,dm. (23) 


In the case of rectangular mirrors the variables can be separated and eqn 
(23) splits into two Fredholm equations of the second kind with symmetric 
kernels, each equation being valid for an infinite strip mirror. 

The equation for the € coordinate becomes 


+J/N 
em tn(Es) = | tals) exp (2m Ea) AE (24) 


For obvious reasons, the u,, are called the eigenfunctions of the finite Fourier 
transform. They turn out to be angular prolate spheroidal functions.?* The 
use of these functions in optical problems has been extensively discussed in 
recent years,3*~ 45) 
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When the mirrors are circular, one can introduce polar coordinates and in 
place of (24) one finds a finite Hankel transform. The eigenfunctions are then 
hyperspheroidal functions.°” +” 

In both cases it is found that, at least for the first eigenmodes, the “spot 
size” (i.e. the size of the region illuminated on the end mirror) is very small 
compared with the size of the mirror. The intensity falls off exponentially from 
the centre and is virtually negligible at the rim. This explains why the losses 
are extremely small, orders of magnitude smaller than in the Fabry—Pérot case. 

Since the finite size of the mirror has an almost negligible influence on the 
mode configuration, it can be disregarded in a simplified theory. One then 
finds Hermite-Gaussian eigenfunctions for the infinite strip case°?) and 
Laguerre—Gaussian eigenfunctions for the circular case.@ This will be dis- 
cussed later. 


Fic. 6. Confocal system of mirrors. 


We find first of all that the system S can be replaced by two lenses, each one 
very close to one of the mirrors; then each system, mirror-plus-lens, can be 
replaced by a spherical mirror as shown in Fig. 6. Each mirror has its centre 
of curvature on the other mirror and the system is confocal. 


4. BEAM WAVEGUIDES AND NON-CONFOCAL RESONATORS 


Let us start with a set of cylindrical coordinates r, ¢, z (Fig. 7) and transform 
these to spheroidal coordinates €, ¢, 0, using the following relations 


r=c/(1+é7)siné, @=¢, z= cécos8, (25) 


where c represents a constant length. 
The surfaces € = constant represent a set of confocal oblate spheroids, while 
the surfaces 8 = constant represent a set of confocal single-sheet hyperboloids. 
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The geometrical foci FF’ (to be distinguished from the “‘optical’’ foci) are 
separated by a distance 2c. The major and minor semi-axes of each ellipse are 
given by a = c(+¢7)*, b = cé; from which the radius of curvature of the 
ellipse at the intersection with the z axis is found to be R = a?/b = c(1+4 E/E. 
The major and minor semi-axes of each hyperbola are given by a = ¢ sin 8, 
b = c cos 9, where @ is the angle between the asymptotes and the z axis. 


Fic. 7. Transformation of cylindrical coordinates to spheroidal coordinates, Surfaces 
of constant ¢ represent a set of confocal oblate spheroids and surfaces of constant @ repre- 
sent a set of confocal hyperboloids. 


We can write the wave equation in spheroidal coordinates (25). Wainstein 
showed? that it can be solved approximately when the following conditions 
are satisfied ; 


ke>1, — sin?@ <1. (26) 
These conditions hold if: 
(1) the distance FF’ is much greater than the wavelength and 
(2) propagation is confined within a small solid angle about the axis. 


In real systems these conditions are usually very well satisfied. 
We introduce a new variable 


t= ./(2ke) 0. (27) 


A solution of the wave equation with separated variables can then be written 
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in the form 
i T, p) = (1 +&?)-4 exp {ilkcé (2n +m+ 1) arctan Ey} 


x cos (md) t™L,” (=) exp (-4), (28) 


where L,, denotes an associated Laguerre polynomial. This expression clearly 
represents a travelling wave, whose wave surfaces coincide with the spheroids 
& = constant. 

The last factor of eqn (28) shows that the field decreases exponentially 
away from the axis. We can define a “‘spot size” s, which represents a region 
about the axis where the intensity of the radiation is non-negligible. 


s= E ae] (29) 


From this definition it is seen that propagation is virtually confined within a 
single-sheet hyperboloid. Although this hyperboloid represents an ideal case, 
it does show that the system acts like a waveguide. For this reason, it is 
customary to speak of a “beam waveguide”. A wave described by eqn (28) 
will be called a “‘beam”’ for short. 

If one of the spheroidal surfaces of Fig. 7 is replaced by the surface of a 
mirror, the beam will be reflected at this surface and will exactly retrace its 
steps, taking on the same configuration at all positions. If two spheroidal 
surfaces are replaced by mirrors, we obtain a resonator. Because the spot size 
is so small it is permissible for the mirrors 


(1) to be spherical instead of spheroidal, and 
(2) to have a finite size. 


We thus have a general resonator with spherical mirrors. 

The confocal system is formed by the two mirrors € = 1. In the language of 
geometrical optics, the hyperboloid of Fig. 7 would be called a “caustic” 
surface. The existence of caustic surfaces has been proved for resonators of 
different geometries,“°~ 5" for example, toroidal-ring resonators. The exist- 
ence or non-existence of a caustic surface can be directly related to the stability 
or instability of the mode. 

The condition for resonance is found from (28) by requiring that the phase 
shift in going from the first to the second mirror should be a multiple of 7. All 
modes having the same value of 2n +m have the same resonance (degeneracy). 

The cartesian coordinate x can be used in eqn (25) in place of the coordin- 
ate r. A solution of the wave equation, independent of y, can then be found. 
This is the solution for infinite strip mirrors, which turns out to be a Hermite- 
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Gaussian expression instead of a Laguerre-Gaussian expression, as was 
mentioned in the previous section. 

Products of two infinite strip modes (one for x and one for y) can be used 
in place of the Laguerre-Gaussian modes. As Jong as the mirrors are very 
large in comparison with s and there is no perturbation present within the 
resonator, these two sets of modes are perfectly equivalent (degenerate). The 
finite size of the mirrors can act as a perturbation which removes this degen- 
eracy. If the mirrors are circular, the modes are Laguerre~Gaussian functions, 
while for rectangular mirrors the modes are Hermite—Gaussian functions. In 
a conventional gas laser the main perturbation is usually due to the Brewster 
angle windows. This leads to separation of the x and y variables i.e. to 
Hermite—Gaussian modes. Mode patterns of the type shown in Fig. 8 are 
obtained. 


aoe 


TEMg43 


Fic. 8. Example of a mode pattern in a gas laser with Brewster angle windows. 
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Equation (28) describes the field inside a resonator when the diffraction due 
to the finite size of the mirrors is neglected. The equation may also be used to 
calculate the losses, but it has been proved by Wainstein °* ** that this leads 
to an over estimation of the losses. Hence we conclude that diffraction at the 
edges of the mirrors alters the modal field distributions in such a way as to 
reduce the losses. 

It is clear that, apart from the dependence on @¢, a beam is completely 
specified by the location of its centre O (Fig. 7) and of one of its geometrical 
foci, say F. Any other element can be derived by a graphical construction, 
suggested by Deschamps and Mast.“°* Let the beam be specified by the centre 
O and the focus F (Fig. 9) separated by a distance c. We want to find the rele- 
vant parameters of the wave surface through the point M on the axis. 


Pp 


Fic. 9. Graphical construction suggested by Deschamps and Mast for finding the 
parameters of a beam. 


If OM = cé, the wave surface has the equation = OM/c. Let us put 


c c 
tanu = é, R= ———-, = ae 
sin u COS u cos* u 


(30) 


Therefore u = MFO. Using the value of u we can immediately calculate the 
phase shift in expression (28): 


phase shift = (Qn+m+1)u. (31) 


The quantity R is easily found to be equal to c(1+€é7)/é, i.e. to the radius of 
curvature of the wave surface at M. It is seen by inspection that the centre of 
curvature C of the wave surface can be found by drawing the perpendicular to 
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MF through F. As for the quantity ©, it turns out that its value c(1+€7) is 
proportional to the square of the spot size. Conventionally, & will be called 
the “beam area”. The beam area is found by inspection to be equal to MP. 

By this graphical construction or by other types of equivalent transforma- 
tions°~°”) one can easily solve a number of practical problems. 


Stable 


Fic. 10. Arrangement of a resonator with spherical mirrors corresponding to a stable 
configuration. 


First of all, let us determine the parameters of the beam which travels back 
and forth within a resonator with spherical mirrors. Let C, and C, (Fig. 10) 
represent the centres of curvature of the mirrors. They must coincide with the 
centres of curvature of the wave surfaces of the beam at M, and M, respec- 
tively. Obviously, one has to draw the two circles shown in the figure; their 
intersection F determines the geometrical focus of the beam and O its centre. 
When a point of intersection can be found in this way, the configuration is 
“stable”. An “unstable” configuration is shown in Fig. 11. All pairs of mirrors 
which give rise to the same beam are “equivalent’’.. 


Unstabie 


Fic, 11. Arrangement corresponding to an unstable configuration 
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In the confocal configuration, shown in Fig. 12, the two circles coincide and 
any point on the circumference can be labelled as F. However, it is clear that 
the symmetrical location of F, shown in the figure, is the one which gives 
minimum beam area on the mirrors. This situation corresponds to minimum 
loss, and takes place in practice. 


Fic. 12. Configuration in a confocal resonator corresponding to minimum loss. 


The graphical construction is useful for describing the passage of a beam 
through an optical system, say a thin lens (Fig. 13). Let the incident beam be 
specified by its characteristic points O and F. We want to find the character- 
istic points O’, F’ of the image beam. First, we find the centre of curvature C 
of the wave incident upon the lens. Obviously, the image of C is at C’ accord- 
ing to the usual thin lens formula. Next, we find the point P which determines 
the beam area on the plane of the lens; P’ must coincide with P. It is now an 
easy matter to complete the construction and find F’ and O’. 


Fic. 13. Graphical construction for describing the passage of a beam through a thin lens. 
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5. PERIODIC SYSTEMS AND STABILITY 


Since the early days of open resonators, it has been noted that the theory of an 
open resonator is in many respects equivalent to that of a travelling-wave 
system in a periodic structure. 

For instance, we can start with the resonator of Fig. 5 and “unfold” it, by 
substituting real spaces for the virtual spaces created by the mirrors M,, M>. 
We thus arrive at the travelling-wave system depicted in Fig. 14, where the 
mirrors are replaced by periodic aperture stops My, M,, Mo... . 


Fic. 14. Traveiling-wave system equivalent to a confocal resonator. 


Each pair of consecutive lenses, say So, S;, is equivalent to an afocal system 
of unit magnification. The rear focus Fy’ of Sy coincides with the front focus F i 
of S,. The separation d between the rear principal plane of S, and the front 
principal plane of S, is equal to twice the focus length. In a chain of such sys- 
tems each ray repeats its path periodically with respect to the lenses. 

If the spacing d is now changed from twice the focal length, the pair of 
lenses So, S, is no longer afocal. However, it may be possible for a set of n 
consecutive subsystems Sp, Sj, ..., S,-, to form an afocal system of unit 
magnification. The following analysis, due to Burch®*®) shows that this is 
possible. 

Let the spacing of the identical systems S,, whose focal length will be taken 
as unity, be such that F’,_, lies in front of F, by a distance «. We take an 
object point located on the axis at a distance x, to the left of the front focus 
F of So. The image point will be located at a distance x,’ = 1 /Xo to the right 
of Fo’ and consequently at a distance x, = a— 1/x9 to the left of F,. We thus 
obtain the recurrent relation 


x, =a-—, Xz =a-— = Seas (32) 
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After traversing n subsystems, the displacement x, is given by the continued 
fraction 


1 
Mi 1 


1 
,= (33) 
Xo 
where « appears n times. 
For the system to be afocal, we require x) = x, = 00 and so obtain an 
equation for « of degree (n— 1), whose solutions can be shown to be given by 


“% = 20s (= r), (34) 


where r = 1, 2, ...,2—1. There are therefore n—1 distinct solutions. However, 
it will be noted that among the solutions for n there are also solutions for the 
submultiples of n. The lateral magnification, which equals (—1)"/(x9x,...X_—1) 
turns out to be (— 1)’. 

If we now assign a value to «, we must distinguish between two cases. If 
|x| > 2, there are no values of r and n which satisfy eqn (34). Indeed, as is well 
known in optical design and in accelerator theory, if the spacing between the 
foci exceeds twice the focal length, the system is unstable®®®® and any ray 
traversing the system gradually moves away from the axis and will eventually 
fall outside the system. If |x| < 2, it may be possible to find a pair of integers 
r,n which satisfy eqn (34) and then we have an n-fold afocal system. However, 
even if such a pair of numbers does not exist, the fraction r/n can be chosen as 
close as desired to any real number. We can conclude that any physical sys- 
tem with || < 2 is practically n-fold afocal, provided that n is made sufficiently 
large. 

Let us now consider an open resonator with spherical mirrors M,, M2, 
having radii of curvature R,, R, (positive for concave mirrors, negative for 
convex mirrors) and spacing d. This system is equivalent to a succession of 
thin lenses having alternatively a focal length f,; = R,/2, and f, = R,/2. A 
subsystem S,, will be formed by a pair of successive lenses. Since the focal 
length of the subsystem will be taken to be unity we find 


1 1 d 


ak fe on 
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The distances /, /' from the front focus to the first lens and from the second 
lens to the rear focus, are given by 


d d 
ey ee 36 
r7 r7 oe 


We will now introduce two convenient parameters 


ad aay 92=1- (37) 
or d 
91 = 1-55 92 = 1—5F- (38) 
From these definitions and from (35) we find 
d = 29,+292.—49; 92. (39) 


The distance « (= d — 1 — I’) from the rear focus of one subsystem to the front 
focus of the next one becomes 


a= 2— 49,92, (40) 
For an n-fold system we have therefore from (34) 


rut 


int ls): 4 
9192 = sin? (=) (a1) 


For stability, the product g, g, must be positive and less than 1. We can plot 
a stability diagram (Fig. 15) as was done by Boyd and Kogelnik,“® and by 
Fox and Li.2@” 

The coordinate axes represent g,; and g,. The shaded regions correspond to 
high loss or unstable systems, while the unshaded regions, lying between the 
axes and the hyperbola g, g, = 1 represent low loss or stable systems. Points 
representing mirrors with equal radii lie on the 45° line g, = g,. The origin 
corresponds to identical mirrors spaced confocally, the point (1, 1) corres- 
ponds to plane parallel mirrors and the point (— 1, — 1) corresponds to con- 
centric mirrors. Confocal systems lie on the dashed hyperbola 2g, g. — g,; — 
92 = 0 and it is seen that they are all unstable, except for the system with 
identical mirrors. However, it must be emphasised that the confocal resonator 
with identical mirrors lies on the boundary between stable and unstable 
regions. Consequently, even a small departure from ideal construction of such 
a resonator may cause substantial loss. 
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The low-loss regions of Fig. 15 represent three sets of possible geometries: 


(1) both mirrors are concave and the centre of curvature of each mirror lies 
beyond the other mirror; 

(2) both mirrors are concave and the centre of curvature of each mirror lies 
between the other mirror and the centre of curvature of the latter; 

(3) one mirror is convex and the other is concave, and the centre of curva- 
ture of the concave mirror lies between the convex mirror and the centre of 
curvature of the latter. 


WY, 4,='-\d/R,) 


oo i : ys 
— Pe 


(0;0)/ b 29, = 1-(d/R,) 


Fic. 15. Stability diagram for various mirror configurations. The shaded regions corre- 
spond to high-loss (unstable) systems, and the unshaded regions to stable systems. (After 
Fox and Li.@*) 


\ 


Of course, the transition between low-loss and high-loss regions is not as 
sharp as is suggested by geometrical optics. It is a gradual transition which 
becomes more and more abrupt as the Fresnel number N increases.2“) In any 
case, it turns out that mirrors with equal radii confocally spaced give rise to the 
lowest loss. The loss decreases with increase in the Fresnel number, in the low- 
loss region, but fluctuates without eventually decreasing, in the high-loss 
region. The total optical path in an n-fold system is nd, so that the conditions 
(21), (22) for resonance become 

A 


A 
ne = qe as (42) 


where ¢ = | for symmetric modes and ¢ = 0 for antisymmetric modes. 
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However, not all the resonances (42) may be present in a real system due to 
the fact that the system is folded upon itself. Some eigenfunctions may be 
identically zero, due to destructive interference in successive round trips. For 
instance, Boyd and Kogelnik®® give the following formula for the case of 
rectangular mirrors 


2d 1 
—=qt — (s-+8) arc cos \/(91 92), (43) 


A 
where q and s are positive integers and ¢ has the same meaning as before. By 
eliminating g, gz using eqn (41), it is easy to show that all the resonances (43) 
are included in the general expression (42), whereas the reverse is not true. Of 
course (43) could be derived from eqn (28). 


6. ROoF-MIRROR RESONATORS 


Stable spherical-mirror resonators, of which the plane parallel Fabry—Pérot 
and its relatives°!~©* are particular cases, have been most widely investi- 
gated. However, there are other stable geometries which may have some spec- 
ial useful property, such as enhanced mode selectivity,°°~°© large mode 
volume”? or a highly concentrated field with low loss.©® 

The use of a diffraction grating as a mirror of a resonator has recently found 
an interesting application in the so called ‘‘orotron’’. This device is very useful 
for the generation of millimeter waves.?~ 7) 

Sometimes unstable spherical-mirror resonators can also find useful appli- 
cations.’*~75) The spill-over radiation can be conveniently used for the 
extraction of energy (coupling to external space or to other devices). Unstable 
geometries may also be used to provide large mode volume and good Iateral- 
mode selection. 

Another interesting geometry first analysed by the author is that of the 
flat-roof resonator.’®~7” In this type of resonator (Fig. 16), both terminal 
mirrors are roofs with extremely small angles (107 © to 107 radians) which in 
practice appear to be very nearly flat. 


Pa 


Fic. 16. Roof-mirror resonator. (After Toraldo di Francia.” 
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In order to investigate the modes of the flat-roof resonator, it is expedient to 
start with a closed diamond cavity GBHE (Fig. 17) of very small angle «a. 
Denoting the distance between the two vertices B and E by d, we will put 
d = qoA/2, where qo represents a very large number, not necessarily an integer. 


Diamond cavity 


Fic. 17. Showing how a flat roof resonator can be considered as a section of a diamond 
cavity. The angle « is between 10~° and 10-3 radians. (After Toraldo di Francia.) 


The left hand side of the cavity will be described in terms of cylindrical co- 
ordinates r, z, @ with the z axis through G, while the right hand side will be de- 
scribed in terms of a symmetrical system of coordinates with the z axis through 
H. In both halves, solutions of Maxwell’s equations with the electric (or 
magnetic) field parallel to z are given by 


E = J, (kr) cos (v@). (44) 


with the boundary condition 
mT 
v= q ae ’ (45) 


where g is an integer. We are interested in the case where g ~ qo with q <Q. 
The order of the Bessel function v then turns out to have a very large value. 
In the region of interest, i.e. near the centre of the cavity, kr also has a very 
large value, of the same order of magnitude as v. We can therefore use the 
asymptotic expressions for Bessel functions of large order and argument.{7®) 
It is expedient to consider the “cut-off” plane AD (and correspondingly CF) 
for which AD = gA/2 and to introduce the abscissa x with respect to the cut-off 
plane, measured positively from the centre of the cavity. We can write 


cos [5 (240 )* * | 
13 Vee 4 


x( q\x| )’ 
aa 


forx <0 (46) 
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and ms | 8x ( ax? )'] 
’ 3 Ge 
J (kr) = for x > 0. (47) 


Equation (46) represents a standing wave. We must use the fact that « is very 
small and ignore the curvature of the nodal and antinodal surfaces. It is then 
an easy matter to match the fields in the left and right halves of the cavity. The 
resonance condition for the mth mode then becomes 


ee 


-1-2 -0-8 -0-4 


-!16 <-I:2 


intensity in arbitrary units 


oat 
-—1-6 -1:2 -0-8 —-0O-4 1e} 0-4 0-8 1-2 1-6 


4/3 
Lateral coordinate in units I-(2(q/a@) A 


Fic. 18. Intensity distributions for several modes in a flat roof resonator. (After Toraldo di 
Francia,7”) 
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Equation (47) shows that outside the cut-off planes the tield falls off exponenti- 
ally. It is therefore permissible to limit the walls of the cavity to the portions 
ABC, DEF, thus obtaining a flat-roof resonator. 

Figure 18 shows the intensity distribution of a few modes, calculated using 
the above formulae. 

This type of resonator and its optically equivalent beam waveguide, con- 
sisting of a sequence of special narrow-angle prisms have been investigated 
by Kahn” in terms of geometrical optics. Kahn has calculated the transverse 
field distribution using simple electromagnetic theory. An appropriate defini- 
tion of the spot size is found to be in agreement with that of Toraldo di 
Francia.”” 

The results of the approximate theoretical approach (valid for very small 
values of «), have been verified by Checcacci, Consortini and Scheggi‘®®~ ®” 
by numerical computation and by experiment in the X band. 

The numerical solution of the integral equation for the flat-roof was found 
by the Fox and Li method of iteration. The losses were evaluated as functions 
of the roof angle «. The investigation was not limited to small values of «. 
Negative values corresponding to convex flat-roof resonators were also consi- 
dered. Figure 19 shows the power losses per transit plotted versus « for N = 
2-22. In the positive region of « the losses for the zeroth order mode have 
minima and maxima. In the region of the first maximum, the second order 


2a=45A d=288a 


(N=2-22) 


anno ecie 


Power loss per transit 


=n 
aa 


a (degrees) 


Fic. 19. Plot of the power loss per transit versus the angle « in flat roof resonators. (After 
Checcacci et al.) 
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mode losses have a broad minimum. Thus there is a region which is unstable 
for the fundamental mode, but in which a higher order mode has lower losses. 

The existence of minimum losses corresponding to particular values of « 
had been found by Voitovich®* for a resonator terminated by conical re- 
flectors. 


b =. xo 
(N= 2-22) eso 
©) 1 
4 1 
a= -l5 - a=l5 
O ’ 0 1 
1 4 
= a=— 30’ a= 30' 
2 
Q@ 
€ 
10) 1 16) 4 
1 1 
a= 4)’ a=4l 
ce) j 0 1 


Lateral coordinate 


Fic. 20. Zeroth order mode patterns for positive and negative values of «. (After Checcacci 
et al, 8) 


For negative values of «, the losses increase with |x| as one would expect due 
to the geometrical instability .There is also a region here where an inversion 
occurs between the fundamental and higher order modes. In passing from a 
convergent to a divergent resonator the mode shapes change noticeably as can 
be seen in Fig. 20 where zeroth order mode patterns are shown for some posi- 
tive and negative values of «. It can be observed that the convex flat-roof re- 
sonator has a larger mode volume than the geometrically stable concave 
resonator of the same type. The fact that the field intensity tends to shift its 
maxima towards the edges of the mirrors by increasing |a| can be very useful 
in resonator coupling. 

Experimental tests were carried out in the X band using a roof mirror of 
variable angle. Figure 21 shows a comparison between the approximate 
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measured 
meee diamond cavity approach 
—-—+— numerical computations 
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Fic. 21. Comparing the calculated field distributions, in the zeroth and second order 
modes, with experimental values. Mirror aperture 2a = 452 and a = 1°16’. 
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theory, the iterative computation and the experimental measurements for the 
field distribution (intensity) of the first two even (symmetric) modes. 

The problem of modes and losses in a 90°-roof resonator has been tackled by 
Checcacci and Scheggi.‘** 85) They carried out some experimental tests on an 
X-band model consisting of a plane mirror and a right angle roof mirror. In 
order to investigate the effect of the departure of the roof angle from 90°, this 
was varied over a small range around 90°. 


14 
[+2 ? 
vt 
a aes 
2 
a 0-8 
Ww 
8 
: 0:6 
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@) r 2° 3° 4 
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Fic. 22. Comparing the power loss of the (90—.«/2)° roof resonator with that of the flat 
roof resonator. (After Checcacci and Scheggi.“**) 


In the geometrical optics approximation, the behaviour of a (90—«/2)° roof 
resonator may be predicted from that of an equivalent flat-roof, at least for 
small values of a. For « = 0 the flat-roof becomes a plane mirror, hence one 
can expect that the 90°-roof mirror will also be equivalent to a plane mirror. 
The behaviour of the energy loss per transit as derived from the measured Q’s 
is plotted versus « in Fig. 22 along with the analogous curve computed for the 
equivalent flat-roof mirror. For small values of « (<2°) the two curves show 
vaguely similar behaviour, but the similarity disappears for larger values of « 
and in particular the experimental curve does not follow the high loss region 
of the flat-roof resonator. 
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Review of Nonlinear Optical Phenomena in 
Condensed Matter 


N. BLOEMBERGEN 


1. INTRODUCTION 


This chapter will review a variety of nonlinear optical phenomena which have 
become accessible to experimental investigation in the past decade due to the 
extremely high light intensities obtainable in pulsed laser beams. Since con- 
densed systems with a large number of degrees of freedom can exhibit a com- 
plex array of different phenomena, many simplifying assumptions will be 
made in the theory. From the outset the electromagnetic fields will be treated 
classically. The chapters by Glauber, Haken and Louisell in this volume show 
that a complete quantum mechanical treatment of some of the simpler non- 
linear phenomena, such as parametric generation, already exists.“ The semi- 
classical approach permits a quicker, unified survey and leads to good agree- 
ment with practically all experimental situations, provided spontaneous 
emission noise fields are added in an ad hoc manner. 

The general Hamiltonian may be divided into the vacuum field, the material — 
Hamiltonian and a term representing the interaction between electromagnetic 
radiation and matter. In addition, the interactions with a thermal reservoir 
with a large number of degrees of freedom will lead to a phenomenological 
damping term, 

H = Him +H mar +H int + damping (1) 


In general # y contains all the vacuum oscillators, and #%,,,, in diagonal- 
ized form contains all the energy levels of the condensed matter under consid- 
eration. In many situations it is possible to restrict the attention to a limited 
number of electromagnetic oscillators and/or a limited number of material 
energy levels. The interaction Hamiltonian may often by written, e.g. for 
bound electrons interacting with light waves or electric fields of lower fre- 
quency, in the electric dipole approximation, 


Him = — 24%. E= — Din-E, (2) 
t t 
where y; is the electric dipole moment operator of the ith particle. 


For a statistical ensemble of particles, the density matrix p describes the 
, 355 
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quantum mechanical behaviour of the material system. In the absence of 
damping it obeys the operator equation 


ihp = (#o, p] — E.[h, pl. (3) 
The macroscopic electric polarization of the medium is given by 
P = NTr (pp), (4) 


where N is the number of identical particles per unit volume. The electro- 
magnetic field obeys the classical Maxwell equations. The wave equation for 
the electric field then takes the form 
1 OE 4n 0?P 

WAMAUE gee 8 eget ©) 
The set of equations (3-5) constitute the starting point of the semiclassical 
treatment of nonlinear optics. Only the simplest case allows solutions in a 
closed form. As an example, the propagation of a single quasi-monochromatic 
pulsed wave travelling through a medium containing atomic systems with only 
two energy levels will be discussed in the next paragraph. For more compli- 
cated situations one must resort to perturbation procedures. 


2. LIGHT PULSE PROPAGATION THROUGH A Two LEVEL SYSTEM 


When a laser pulse, e.g. from a ruby laser, is passed through a second ruby 
crystal, it will normally be absorbed, if the initial population of the two levels 
of the Cr?* ion involved in the laser transition is normal. If the second ruby 
laser crystal is also optically pumped to yield an inverted population, the laser 
pulse will be amplified. For very intense light pulses there may, however, be 
no net gain or loss of light energy. This effect is called self-induced transpar- 
ency, and was first demonstrated and discussed in detail by McCall and 
Hahn. An excellent detailed account, consolidating the efforts of many 
other authors, has recently been given by Icsevgi and Lamb.) The case of 
one electromagnetic mode interacting with a two level system provides the 
simplest physically meaningful situation of nonlinear response. 

The electric field is assumed to be a quasi-monochromatic travelling wave, 
linearly polarized in the x-direction, with a slowly varying amplitude and phase 


E,, = A(z, t) exp (ikz—iwt) = &(z, t) exp [ip(z, t)] exp (ikz— it). (6) 


The two energy levels are denoted by |a> and |b», with energies —4hw, and 
+4hw, respectively. The only nonvanishing matrix elements of the electric 
dipole moment operator of interest are 


(Hx)ab = (Hx) ba = Hu. 
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The frequency w is chosen close to the resonant frequency @y with A = wy — o. 
The wave number k = we~‘e'/? includes the effect of the linear dielectric 
constant of the bulk crystal, exclusive of the two Cr?* levels. It may also 
include the effect of other light losses by giving the linear dielectric constant 
an imaginary part. 

It is well known that the equation of motion of a two by two density matrix 
has a one to one correspondence with the motion of a real classical magnet- 
ization vector. Introduce three real components, (X, Y, Z) which are slowly 
varying quantities in the interaction representation, such that; 


X —iY = 29,, exp (—ikz+iot) 
Z= Paa — Poo 
l= Paa + Po Par = Pou 


In terms of these quantities the operator equation of motion (3) may be written 
as 
; 1 
X(z, t, A) = —AY(@,t.A)-——X, (7) 
2 


Y(z,t, A) = AX(z,t, A) +h7! p&(z, t) Z(z,t, A) — —Y, (8) 
2 
Z(z,t, A) = —h7! p&(z,t) Y(z,t, A) ~ (ZZ). (9) 
1 


The last terms on the right-hand side are phenomenological damping terms. 
In writing these equations, rapidly varying components in X, Y and Z have 
been ignored. These terms would give rise to harmonics of small amplitude, 
as discussed in the next section. The truncation of these counter-rotating 
terms permits us to restrict ourselves to one mode of the electromagnetic field. 
The equations of motion would be strictly correct for a circularly polarized 
wave and magnetic dipole matrix elements of the form (,),, = — i( Ly)ap- The 
electric polarization is given by 


P,(z, 1, A) = NuLX(z, t, A) — iY(z, t, A)] exp (ikz — iot). 


It is seen that X corresponds to the in-phase component and Yto the compon- 
ent of polarization which is 90° out of phase with the electric field. Substitu- 
tion of this polarization into the wave equation (5) leads to the complex 
amplitude equation, 


c 0A, 0A _ 2nNyw? 


ian gp re 


Here n is the linear index of refraction of the crystal, exclusive of the two level 
system. Second order derivatives in A have been ignored (slowly varying 


358 N. BLOEMBERGEN 


envelope approximation). Equating the real and imaginary parts on both sides 
and allowing for a distribution of resonant frequencies g(A) (inhomogeneous 
broadening), one finds 


c 06(z,t) _ 06 (z,t) 2nNuo ft” 
oer a ee { : Y(z,t, A) g(A)dA (10) 


and 


(£244) 2nN po 
n Oz ot 


+00 
= 5 | X(z, t, A) g(A) dA. (11) 
n -—@ 

The solutions of the set of five coupled partial differential equations with five 
real variables have been discussed by a large number of authors.‘ *) Icsevgi 
and Lamb® add a sixth equation, because p,,+ P55 is not assumed to be 
equal to unity. If the laser transition is between two excited levels, each of 
which may be populated independently by a pump mechanism and may decay 
(relax) independently to lower states, this is a useful extension. So is the gener- 
alization to moving particles, where the density matrix p also depends on the 
velocity. For the propagation characteristics of a travelling wave the resultant 
Doppler effect may be incorporated in the inhomogeneous broadening g(A). 

Here we shall restrict ourselves to a solution, in which the damping terms 
in eqns (7-9) may be ignored. This solution applies in the limit h~*p&T> > 1. 
For the case of exact resonance and no inhomogeneous broadening, g(A) = 
6(A) and A = 0, the equations have a solution with X = 0 and ¢ = 0. The 
remaining set of three coupled equations for &, Yand Z can be solved exactly, 
with these quantities being functions of the argument t—z/v. They correspond 
to the solution in which the envelope propagates without change of shape 
with a velocity v. If g(A) is an even function of A, one can still find solutions 
with @ = 0, while Y(A) is an even function of A and X(A) an odd function. 
Several workers“ have found periodic solutions in terms of Jacobian elliptic 
functions, whereas McCall and Hahn had shown earlier that an aperiodic 
pulse solution exists in the form 


&(z,t) = & sech [hp &y (t—z/v)]. 


At resonance, A = 0, the vector (X = 0), Y, Z is rotated through an angle 
t 
@(z, t) = | h-*p&(z,t’) dt’. 
—-e@ 


At the end of the pulse this angle is 2x and the material has returned to its 
initial state. At high power levels the pulse thus passes without attenuation or 
gain. The feature which distinguishes it experimentally from pulse propagation 
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through a saturated medium is that the velocity of propagation is changed, 


1 2nNwp?r? im g(A) dA 


D> —o» 1+(Aat)?’ 


n 
c e hen 
wheret 1 = h-'pé 
If the medium is initially absorbing, energy is taken up by the medium at the 
beginning of the pulse, and is returned to it at the end of the pulse. This has the 
effect of a retardation on the propagation of the envelope of constant shape. 
Another way of expressing this is to say that a fraction of the energy is stored 
in the medium and this part does not contribute to the energy flux, so that 
v x (EM+ material energy density) = (c/n) x EM energy density. This pulse 
delay has been observed experimentally by McCall and Hahn. The pulse from 
a ruby laser was passed through a second passive ruby crystal kept at liquid 
helium temperature to keep T, long. Forh~'&uT, > 1 transparency occurred. 

When the material is initially in an excited state, i.e. behaves as an active 
laser medium, the pulse shape would tend to propagate with a velocity larger 
than the velocity of light. This kind of effect has been observed in a Jaser pulse 
amplifier. There is no conflict with special relativity theory. There will never 
be a signal before a time t—zn/c, if there were no input before ¢ at z = 0. If 
there is an input, the leading edge is amplified and the shape of the pulse 
appears to move forward, although it is not a stationary or distortionless 
solution in the strict sense. The hyperbolic secant is of course not a physically 
realistic input, since it starts at — 00. 

Detailed analysis with the aid of computers shows how, in the presence of 
damping, a large input pulse of arbitrary shape, may break up into a number 
of 2 pulses. The only stable solutions are of course those in which the material 
eventually returns to a passive state. For an initially inverted population one 
would have the emission of 2, 32 etc. pulses. For further details references 
2-4 should be consulted. 

We next turn to the numerous situations, in which the damping of the matrix 
elements is sufficiently large, or where the frequencies of the applied fields are 
sufficiently far removed from atomic resonant frequencies, so that the equa- 
tion of motion of the density matrix may be solved by a perturbation pro- 
cedure. In this case one need not restrict oneself to a two level system. The 
response of the material can be described by a series of nonlinear suscepti- 
bilities of increasing order. 


3. LOWEST ORDER NONLINEAR SUSCEPTIBILITY 


When ji! pE| < |A+iT,~+|, ie. when the matrix elements of the interaction 
Hamiltonian measured in frequency units are small compared to the inverse 
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damping time or well removed from resonance, an iterative perturbation pro- 
cedure may be used to describe the response of the material to periodic 
perturbations. 

The response of the material to first or linear approximation is given by 


—Io py? = —1W,; pip Vy pip —ih-* [—p.E(a), P Teas 


where E(q) is the electric field Fourier component at frequency w; p° is the 
steady state or zero order density matrix, and usually contains only diagonal 
elements p,,° giving the fractional occupation of each state. To second approx- 
imation one finds, for example, elements varying at the harmonic frequency 
2a, 


— ie pip (2) = — iy, py? (20) — Ty pgp —ih-* [—w. E@), pO) (12) 
With the trace relation equation (4), one finds a harmonic polarization 
P;2@) = N Tr u;p(2@) = x; (—2@, @, @) E;(w) E,(@) exp (—2iwr). (13) 


The nonlinear susceptibility tensor describing a polarization quadratic in 
the electric field amplitudes is thus determined by this procedure. In the case 
of negligible absorption at both the fundamental and the second harmonic 
frequency and in the case where yu, has no diagonal elements (no permanent 
dipoles), one finds, for example. 

Xxx (— 2, Q, w) aaa » 3 DNC Hon Uy nn’ Uy,n'g Pgg(9) h-? 


x [(20—y,)~* (@-O,,)* + (20+ @,)~? (@+@y,4)' 
—(@+@py,)* (@— yg) "J (14) 


More generally the incident electromagnetic field may consist of a number 
of running waves 


E(r, t) = ¥°[A,(,) exp(ik, . r—iw,t) + A,*(,) exp (—ik, .r+io,t)]. 


The Cartesian component in the i-direction of a polarization at the sum 
frequency, proportional to an electric field at w, and an electric field at w, 
may then be written as, 
PI*(r, @,+@ 2) = D xij — @, —W2, W1, @2) 
x E;(@1) E,(@2) exp [i(k, +k,).r] (15) 
The real polarization is obtained by multiplying by the time factor 


exp{ —i(w, +.@,)t} and adding the complex conjugate. 
The reality of the physical polarization requires 


Xi — ©, — Wz, @1, W2) = X* ijn (@y+@2, —@1, —@9). 
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If the material is not dissipative, the nonlinear susceptibility should be invari- 
ant for time reversal, which is equivalent to replacing each frequency by its 
negative value. The result is that y;,, must be real. This follows of course also 
from the microscopic calculation in the absence of damping. The summation 
convention over the repeated indices j and k is understood in eqn (15). Each 
index runs separately over all three coordinates, x, y, and z. Because the order 
of the field components on the right hand side is of no importance, one has 
obviously 


Xije(—@1— 2, M4, W2) = Xinj( — @1 —@z, W, @). 


D is the degeneracy factor, describing the number of distinguishable permuta- 
tions of the frequencies; D = 2, if w, #4 w,. Eqn (15) may also be used to 
describe the second harmonic polarization with D = 1 and @, = @,. The 
same expression may also be used to describe the polarization at the difference 
frequency w,—@,, by simply taking one positive and one negative frequency 
Fourier component. 

Explicitly, we have 


P**(@, —@ ) = (D = 2) Xije(— 1, +2, @4, —@ 2) E;(@,) E,*(@2). (16) 


The rectification of light is also described by taking the two frequencies equal 
and opposite. A dc polarization is induced proportional to the intensity of the 
light, 


P,X*(0) =(D= 2) xijn(9, 1, —@ 4) E,@,) E,(~—@;) 
+(D = 2) Xin, @2,—@2) E,(@2) E,(—@2). (17) 


Note that the degeneracy factor remains two in this case, because positive and 
negative frequencies are distinguishable. 

The nonlinear susceptibility tensor is obviously a third rank tensor, con- 
necting three polar vectors. The explicit forms for the 32 crystal classes have 
been given by Butcher and Giordmaine®. All elements must of course vanish, 
if there is a centre of inversion. In the general case of the triclinic class 1 there 
are 27 independent elements, because the tensor need not be symmetric in any 
pair of indices, if all frequencies are different. For second harmonic generation 
the elements are obviously symmetric in the indices j and k. Therefore x; 
(—2m, @,@,) has the same symmetry properties as the piezoelectric tensor, 
which connects the polar vector P with the symmetric stress tensor. 

The tensor ¥;;, (—@4—@2, @,,@ ) has a part which is antisymmetric in the 
last pair of indices for w, #4 w, and this part is explicitly displayed in the 
tables of Giordmaine which also include textures, such as liquid crystals and 
isotropic liquids. Because of this antisymmetric part sum frequency generation 
is possible in the cubic class 432, which is not piezoelectric, and does not allow 
second harmonic generation. 
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Sum frequency generation is also possible in an isotropic fluid which does 
not possess a centre of inversion. This fluid must show natural optical activity. 
It must contain a non-racemic mixture of optically active molecules. Sum 
frequency generation has been demonstrated in a solution of /- or d- arabi- 
nose.‘ 

In an isotropic fluid the third rank tensor must of course reduce to a scalar 
in a simple vector relation. The sum frequency generation can be described in 
this case by 


P(@,+@2) = 2x2 {E(@1) A E(@,)}. 


Here x,,, may be considered as a pseudo-scalar property of the fluid. For the 
coherent radiation from this polarization to be observable, it is necessary that 
the light beams at w, and w, propagate at an angle to each other. 

In a non-dissipative medium a thermodynamic time averaged potential may 
be defined, from which the nonlinear polarization is derived by differentia- 
tion. This leads to the permutation symmetry relation, 


D’ Xijn(— Oy —@,@1, @2) = Xjin(O1, —@,—@ 2, W), 


which states that any pair of indices in the susceptibility tensor may be inter- 
changed, provided the corresponding frequencies are also permuted. D’ = 
1+6,,6,2 is another degeneracy factor. It is usually unity, but it equals two, 
if the frequencies w, and @, are equal and j = k. A nontrivial application of 
this relation is the prediction that the rectification of light according to eqn 
(17) is described by the same constant that describes the linear Kerr or Pockels 
effect. Both are derived from 


UNE x Xijx(0, @1,—@,) E,4-(0) E,(@,) E,*(@;). 


Note that 477;;,E ia: gives the part of the dielectric constant ¢,, which is linear 
in the applied dc field. This prediction has been confirmed experimentally. 

In the literature much attention has been paid to the Kleinman symmetry 
relation or Kleinman conjecture. Kleinman? notes that in many experiments 
the frequencies all lie in a range of optical transparency, where dispersive 
effects are small. If dispersive effects are ignored, because the absorption bands 
in the ultraviolet and infrared are sufficiently far removed from the optical 
frequencies involved, the susceptibility tensor becomes, of course, indepen- 
dent of the frequencies and should become symmetric for the interchange of 
any two indices. This leads to the vanishing of additional elements of the 
tensor in certain cases. 

In the point group 32, for example, general symmetry requires that 


Xxy2(— 20, @, ©) = Xyxz(—2@,@,@), and Xexy( — 20, w, w) = 0, 
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if z denotes the trigonal axis. If the Kleinman symmetry is invoked the first 
two elements should also vanish. Experimentally“ an upper limit for these 
elements of second harmonic generation has been set in quartz, which is less 
than 2% of the element y,..,. Other examples “ of the Kleinman symmetry 
relation are the equality, within the experimental error, of the elements y,,, 
(— 2a, w, w) and x,.)(—2e, w, w) in KDP, which belongs to the tetragonal 
class 42m, and of ¥,,,(—2@, @, w) and y,,,(—2@, @,@) in BaTiO; which 
belongs to 4mm. These are illustrations of the fact that the dispersion of the 
optical nonlinearity is indeed small for these crystals in the visible and near 
ultra-violet. 

For the same reason the antisymmetric part describing sum frequency 
generation, which vanishes for second harmonic generation, may be expected 
to be small, because it exists only to the extent that dispersion is not negligible. 
In general, such elements may be expected to be only a few percent of the 
other elements, except in regions of strong dispersion. In these regions dis- 
sipative processes are usually also important and the nonlinear susceptibili- 
ties become complex. 

An interesting application of symmetry considerations results if circularly 
polarized light is propagating along an axis of symmetry of the crystal. 
Since the combined system of crystal and electromagnetic field has this sym- 
metry axis, the resultant harmonic polarization must also be circularly polar- 
ized or vanish. It has been shown that the second harmonic wave generated 
by a circularly polarized fundamental wave propagating along a threefold 
axis is circularly polarized in the opposite sense. If this fundamental beam 
propagates along a fourfold axis, the harmonic intensity vanishes. The inter- 
esting question of conservation of angular momentum in these situations has 
been discussed in terms of the effect of the trigonal crystalline field potential 
on the electronic wave functions.¢ 


4, NONLINEAR COUPLING BETWEEN ELECTROMAGNETIC WAVES 


The nonlinear polarization must now be substituted back into Maxwell’s 
equations. For a wave at the sum frequency w,; = @,+, one obtains for 
example, 


VAVAE; — (37/c”) &(@3) . E = 4n(57/c?) 7" —a, —@, @4, 2): E, Ep. 

. (18) 
This equation describes the generation of the sum frequency, if the fields E, 
and E, can be considered as constant, given paramenters. The equation has 
been solved for a variety of boundary conditions and geometrical configura- 
tions. 


364 N. BLOEMBERGEN 


If the amplitude of the field E,; becomes appreciable, the nonlinearities pro- 
duced by this field must in turn be taken into account. This leads to a coupling 
between electromagnetic waves, which would have propagated independently 
a in linear medium. The response of the medium is hidden in a coupling co- 
efficient, which is proportional to an element of the nonlinear susceptibility. 
For many experimental situations this is precisely what is desired, because 
only the electromagnetic fields are detected by the observer. 

The lowest order nonlinear susceptibility, 


x30 = A,*.x(—@ 3, M2, @,) : a, a), 


leads to a coupling between three electromagnetic waves, with polarization 
directions 43, @,, and @, and frequencies @3 = w,+W,, W, and @, respective- 
ly. The momentum vector mismatch is Ak2 = k, +k,—k, occurring in the 2- 
direction, normal to the boundary of the nonlinear medium. The three coupled 
wave equations reduce to three coupled equations for the slowly varying com- 
plex amplitudes, 


eA, | 1 A; 


armas ar ran Qni xXk, (@,/n,c)A,A,* exp (—iAkz) (19a) 


oz Vg, 


04, , 1 0A, 


ae 2ni xZE, (@2/n2c)A3A,* exp (—iAkz) (19b) 


Oz Vy, 


04, | 1 O43 


ay 2ni xX, (w3/n3c) A,A, exp (+iAkz) (19c) 


Oz V5, 


where v,,, v,,, and v,, are the group velocities of the three envelopes. Use has 
been made of the permutation symmetry relation. 


These equations have the following integrals 


2 
n,|A4I? + n3|As| 
1 3 


= constant 


n,|A,|? 4 n3|A4|? 


@2 3 


= constant’. 


They express the fact that if the photon flux density in the wave at the sum 
frequency is increased by a certain amount, the photon flux densities in the 
waves at w, and w, are decreased by the same amount. These equations are 
known as the Manley—-Rowe relations in parametric amplifier theory. The 
remaining relations of the three coupled complex amplitude equations can be 
reduced to two real equations for the amplitude |A,| and the phase difference 
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o3—,—2. Exact solutions in terms of elliptic functions have been given. 
These solutions are very similar in form to the solutions more recently obtained 
for the pulse propagation, discussed in Section 2. The set of equations (19a—c) 
displays of course striking similarity with the set of equations (7-11). We refer 
to appendix 1 of Bloembergen‘® for further details. 

The value of the wave vector mismatch Ak is very important. Only for 
Ak = 0 will large secular variations of the amplitudes and phases occur. 
Otherwise oscillations of small amplitude are the rule. In that case the para- 
metric approximation suffices, where one or more fields are considered as 
constant parameters. Since momentum or wave vector matching occurs usually 
only for a few well defined sets of waves in a dispersive medium, it is possible 
to focus our attention on a small number of electromagnetic modes. 


5. POLARIZATION CUBIC IN THE ELECTRIC FIELD AMPLITUDES 


In media with inversion symmetry this is always the dominant nonlinearity.“ 
It is described by a fourth rank nonlinear susceptibility tensor whose elements 
may be calculated by reiterating the perturbation procedure outlined in Section 
3. The ith component of the polarization at a combination frequency w, = 
—@,+@,+@3 is given by 


P(@4) = Xie — W4,—@,, M2, 3) E;*(@,) E,(@2) E;(@3). 
This nonlinearity will, in general, provide a coupling between four electro- 
magnetic waves. If the polarization vectors are 4,, 22, 4, and 4, respectively 
and we write 
Xiz34 = 4,* ay: x(—@1, Wy, @3,—@4) : 2344* 
ue = 4;* a, : 4(-—@;, @;, + @j,—@;) : a,a;* 


and the wave vector mismatch Ak2 = k, +k, —k, —k, is introduced, the four 
coupled amplitude equations take the form 


0A, 1 0A, 
+ Ss, 
Oz Vy, Ot 


= +2ni(@,/n,c) [xt%34 Az As Ag* exp (iAkz) 


1 


4 
+ eat A, A,A;*] (20a) 
- 
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0A,* de i 0A,* 
Oz Dv 


—2n i(w,)n, c) [X8E,, A,* Az Ag* exp (iAkz) 


4 
+ Y way A,* A; A;*] (20b) 
j=l 


A, 1 0A, 


Oe ; +21 i(@3/n3 c) [X1334 A, A2* Ag exp (—iAkz) 


4 
I= 


0A,* 1 0A,* 


oz Vg, Ct = —2n i(w,4/ng c) [XT 34 A, A,* A;* exp (- iAkz) 


4 
+> yh Ay* A; Aj*]. (20d) 
j=i 


Depending on whether the fourth rank susceptibility tensor element is real or 
imaginary, and on whether some of the frequencies are identical or different, 
a large variety of physical phenomena can be accounted for. Here the more 
important cases are listed, while Minck et al.“* and Bloembergen” give a 
more detailed review. 


(1) Third harmonic generation; —w, = @, = o;(=0): 
P®“(3@) = y(—3@, w, w, w) A? exp (3ik . r—3iet). 
This polarization will radiate at the frequency 3. 
(2) Two photon absorption: a; = @, = @; 
PX" (@) = ix''y,(—@, —@, @, + @) A|A|? exp (ik . r—iat). 


The nonlinear susceptibility is positive pure imaginary, and has resonance at 
2@ = @,,, Where a and b represent two energy levels with the same parity. 
The absorption becomes strong at high intensities. 


(3) Saturable absorption;+@, = +@, = @;: The same expression as under 
(2) applies, but now yy; is negative and has a resonance for wm = w,,. The 
absorption becomes weak at high intensity. The effect is used in bleachable 
filters. 


(4) Intensity dependent index of refraction; +@, = @, = @; = @4: 
PX¥(@) = x'(—@,—@, @, +) A|A|* exp (ik . r—~iot). 
(5) Stimulated Raman effect; —w, = +@, = w,and@, = —@3; = @,: 


PX" (@,) = izg''(—@,, @,, OF, —@,) A,|Az|* exp (ik, . r—i@,t). 


REVIEW OF NONLINEAR OPTICAL PHENOMENA IN CONDENSED MATTER 367 
In this case the nonlinear susceptibility is pure imaginary and negative, while 


the difference between the laser and stokes frequency is at a resonance of the 
material system, w,—@, = W,p. 


(6) Antistokes generation; @, = @3 = @,, @4 = Wp +Wgy = Wass 
O, = O,—-O,,: 
PM (eqs) = X°"( — Was, Op; OL, — 5) Ay” A,* 
x exp {i(2k, —k,) .r—i(2@,—@,)t} 


The antistokes radiation will only be intense in the immediate vicinity of the 
momentum matching direction, where k,, = 2k, —k,. 


(7) Stimulated Rayleigh scattering; +m, = —@, = @,’ and 
—3 = @2, =Q,: 
Formally this process is the same as the stimulated Raman scattering from a 
rotational] level. The stokes frequency w,’ is now a frequency slightly smaller 
than the laser frequency, so that w,—m,’ = t,', where 1, is the relaxation 
time for molecular reorientation. 


(8) Modulation of the index of refraction: 
PX (34+ @,—@z') = X(—@3—@, + Oz’, 3, Oy, — Oz’) Ap A, *A3 
x exp {i(k, —k,'+k3) .r—i(@3;+@,—@,')t}. 
In this case all four frequencies are different, but w,; and @,’ are close to- 


gether, so that w,—@ ,' < t,~'. Side bands are created around the frequency 
«3. The susceptibility is in general complex. 


(9) Self-focussing of light: 


In this case the frequencies are all equal as for the intensity dependent index of 
refraction. The two wave vectors k, and k, make a small angle with the wave 
vector k, = k, = k;. Since the wave vectors have equal length, there is a 
slight mismatch Ak = 2k, —k;—k,. 


The macroscopic symmetry properties of these effects may be discussed along 
the same lines as indicated in Section 3 for the third rank susceptibility tensor. 
Only a few illustrations will be mentioned here. 

Third harmonic generation in an isotropic fluid is described by one scalar 
element according to the vector equation 


P(3@) = C(E.E)E, 


where C = Xxxxx% = Xyyyy = Xzezz = 3Xxxyy etc. The polarization for the third 
harmonic is parallel to the linearly polarized fundamental field. Circular 
polarization yields no third harmonic. In a cubic crystal two independent 
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constants (7... 4nd Y,..yy) are necessary to describe the tensor.For propagation 
along a cubic axis circularly polarized light will give a third harmonic polar- 
ized in the opposite sense. 

The second harmonic generation induced in an isotropic fluid by the appli- 
cation of a dc electric field is described by two independent constants 


P**(2@) = C, (E(@) . E()) E,, + C2 (E(@) . Ey.) E(@). 


If the dispersion is negligible and the Kleinman symmetry relation is valid, 
then C, = C, and only one independent constant remains. The three constants 
introduced by Mayer,‘'*’ who measured this effect as a function of the relative 
directions of polarization of the incident beam and the dc electric field, should 
have a well defined ratio, consistent with the relation, in our notation, 


Axxxx(— 20, 0, @, ©): Axxyy(— 2, 0, ©, M): Xxyxy(— 20, 0, @, @) = 3::1:2. 


This has been confirmed experimentally. 

The experimentally observed magnitude of nonlinear susceptibilities may 
be compared with the theoretical calculation outlined in Section 2. As eqn 
(14) shows, the matrix elements between all excited states must be known. If 
the denominators are all taken to be equal, (this is a good approximation if 
the light frequencies are small compared to any average excitation frequency 
2), a closure rule may be applied and the susceptibilities may be expressed in 
terms of moments of ground state wave functions alone. In this approxi- 
mation there is no dispersion, and one finds for the susceptibilities of bound 
electrons in the ground state |g>, the following expressions, where 


X; = x: —<glxil9>, 


2Ne*7. | 
xp 7 hO <9|X;X 19> 
ee 3Ne® 
Xi = Hoe 69% 5; £,19> 


wi 4Ne* Bode on Soon, 
Xin = OQ? <glXi Xj X, XQ) —2<gl|%; Xjlg><g1X X,|g>- 


We assume N equivalent valence electronic orbitals per unit volume. For a 
given crystal structure one has an additional geometrical factor to take account 
of the orientations of the orbitals in the unit cell. The observed linear suscepti- 
bility may be used to determine the effective mean excitation frequency Q. The 
nonlinear susceptibilities have thus been calculated for the simple crystal 
structures of the IIJ-V and group IV semiconductors, on the basis of various 
models for the ground state wave function. 
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Table I compares the nonlinear susceptibility describing second harmonic 
generation of the 10-6 yz CO, laser line with theory, and Table IT compares the 
mixing of the 10-6 4 and 9-6 p light of two CO, laser lines to yield a new com- 
bination frequency 2w,—q@, of wavelength 11-6 , in these materials. The 
agreement is satisfactory if a suitable model for the ground state wave func- 
tion is chosen. The theory of the nonlinear susceptibilities is plagued by the 
same difficulties as the theory for the linear susceptibilities. Since the lowest 
order susceptibility depends on the lack of parity of the wave function, it is 
quite sensitive to the amount of asymmetry ,which is rather well described by 
Phillips model of electronegativity. 


TABLE I 


Comparison between experimental and theoretical values of 27, 4(—2@, @, @) in 
units of 107 °e.s.u. The harmonic generation is due to valence band electrons, while 
@ corresponds to the 10-6 4 CO, laser line. 


Crystal InAs GaAs GaP GaSb 
Experiment +1-0 + 0-45 +0-26 +151 
Theory? — 1:5 ~— 0-43 —0-21 —78 
Theory +0-96 +0-57 +0:34 +1-2 
Theory? +2:-05 + 0-95 +0-60 +0°80 
Theory” +11 +0°85 +0-2 +1:8 


* Wynne, J. J. Ph.D. Thesis, Harvard University (1969); Bloembergen, N. and 
Wynne, J. J. Phys. Rev. 188, 1211 (1969). 

> Sha, S. S. and Bloembergen, N. Phys. Rev. 171, 891 (1968). 

© Levine, B. F. Phys. Rev. Lett. 22, 787 (1969). 

4 Flytzanis, C. and Ducuing, J. Phys. Rev. 178, 1218 (1969). 

° Flytzanis, C. C.r. hebd. Seanc. Acad. Sci., Paris. B267, 555 (1968). 


TABLE IT 


Comparison between experimental and theoretical values of the nonlinear mixing 
susceptibility 4y74.(—2@,+@2,@ 1,1, ~@ 2) in units of 107 *°e.s.u. The mixing 
is due to valence band electrons, m, and @, correspond to the 10-6 yw and 9-6 pp CO, 
laser lines respectively. 


Crystal Ge Si GaAs 
Experiment? +1:5 0-08 +0-10 
Theory? —0-88 —0-063 —0-13 
Theory* — 16 —0-53 as 


* Wynne, J. J. Phys. Rev. 178, 1295 (1969). 

> Jha, S. S. and Bloembergen, N. Phys. Rev. 171, 891 (1968). 

© Levine, B. F. Phys. Rev. Lett. 22, 787 (1969). (Since references ° and © use a dif- 
ferent definition of the complex amplitudes, the value 4 y has been tabulated, 
where x follows the definition used in this paper). 
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6. STIMULATED RAMAN Errecr“!4) 


In the stimulated Raman effect two waves are incident on the material system, 
one at the laser frequency @,; and one at the stokes frequency @,, so that 
@,—, corresponds to a resonant frequency @,, = w, of the material. This is 
often a molecular vibrational frequency, but it may correspond to an electro- 
nic transition. Quanta are scattered from the laser beam to the stokes beam, 
while the material makes a transition from state |a> to |b>. This process is 
described by a Raman susceptibility, which is negative imaginary at the reson- 
ance condition @,—@, = w,. The amplitude of the stokes wave obeys eqn 
(20a), which in this case for a process stationary in time reduces to 


dA, af 
dz aml +27(@,/n, c) (-x Raman) |A,|7A,. (21) 


The amplitude grows exponentially, as the nonlinear Raman polarization is 
always leading the electric field A, by 90° and continues to do work on the 
wave. At the same time the laser beam would be attenuated because we have 
the relationship 


ARaman = +Xjjii( — Og, Os, — Wp, OL) = X* iijj(— Oz, OL, — Wy, W,). 
Thus eqn (20b) becomes 


dA, 


Ge eeu €) (— 2" raman) |A,|7AL. (22) 


This equation describes the depletion of the laser beam. Initially the laser 
intensity |A,|? may be taken as a constant parameter. The stokes intensity 
then grows exponentially. 


TF , = F 9 exp (952) (23) 


with a Raman gain coefficient given by 
Gs = 4n(@,/n, C) [X" Ramanl |Arl?. (24) 


This gain constant is also directly related to the cross section for spontaneous 
Raman scattering, because the same material matrix elements occur in the 
calculation of the nonlinear susceptibility describing the stimulated counter- 
part of the spontaneous emission. 

When an antistokes frequency w,, = w,+, and a laser frequency w, are 
considered together, the exponential loss will occur at w,,, while @, will 
exhibit exponential gain. When the population of the two vibrational levels 
is inverted, the nonlinear susceptibility changes sign and gains and losses 
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would be interchanged. With a non-inverted population it is nevertheless 
possible to have exponential gain at the antistokes frequency, but only in 
particular directions where there exists a close coupling between stokes and 
antistokes waves, i.e. for small Ak2 = k,+k,,~2k,. Due to the normal colour 


k.(w-w,,) 
a4 les kgsf Wi+wy) Ak 
ee a ares OEY 
Ke 2k, 
(a) 
' 
ks(w) kas(w") i 
t 
ky (w,) 23k, k(w) 2(k, +8k,) 


(b) 


Fig. ta. The wave vector relationship in stimulated Raman scattering of stokes and anti- 
stokes radiation. 


Fig. 1b. The wave vector relationship of scattering of light without significant change of 
frequency due to the intensity dependent index of refraction. The laser pump wave vector 
is corrected by dkz = 4k107 opt 


dispersion in most liquids the antistokes gain occurs in the cone with an apex 
angle of a few degrees, as shown in Fig. la. If the laser field is considered as 
a constant parameter, eqns (20a and d) with A, = A, and A, = A,, become 
in the steady state regime 


dA 

aS =F 2ni(@,/n, c) ae |A;|?4, as 2ni(@,/N, c) XsaLL At A, exp (iAkz) (25a) 
dA,,* ; #2 : : 
ae = —271(@,/Ma5 ©) Nasu Ay*’ As EXP (—iAkz) ~27i(@,,/N45 C) 


x YNE|A,|? Aqy*. (5b) 


This is a set of two linear coupled equations in A, and A,,* and can be solved 
exactly, by anticipating exponential solutions of the form A, exp (ik,z+iAKz) 
and A,,* exp (—ik,,z+iAKz). Substitution into eqns (25) leads to a quad- 
ratic equation in AK and the intensity gain coefficient is 2 Im AK or, 


9s = + 2 Imf4(Ak)? +22(@,/n, 0) "|, |2AK]!2. (26) 
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Here we have made the (unnecessary) approximation that the dispersion in 
the nonlinear coefficients can be ignored. For large momentum mismatch, 
[Ak| > [22(@,/n, c)|7%"|]*/7 |Az|, the square root may be expanded and g, 
reduces to eqn (24). The stokes and antistokes waves are uncoupled and have 
the single wave gain or loss. Near Ak = 0 the normal modes are mixtures of 
stokes and antistokes. A wave with 70% stokes character and 30% antistokes 
character will have an exponential gain, while the mode with 70% antistokes 
character in the same direction will decay exponentially. One thus observes an 
exponential gain for antistokes light because it is dragged along by the dom- 
inant stokes character of the mode. Note that for exact momentum matching, 
Ak = 0, there is neither gain nor loss. The normal modes have exactly half 
stokes and antistokes character. 

Many further examples of this coupled mode behaviour will now be given. 
In fact, the pure stokes generation may be considered as the parametric down 
conversion of a laser wave into the stokes wave and a vibrational wave. In 
this description the response of the material represented by the off diagonal 
element of the density matrix p,, is explicitly displayed, whereas in eqn (21) 
it is hidden in the Raman susceptibility. The dispersive behaviour of the op- 
tical phonon wave is described by 


22: 2 2 
oO, = 0,0 +Bk, > 


where ,,9 is the vibrational frequency of an isolated molecule. The optical 
phonon branch is usually very narrow and the dispersion constant £ is cor- 
respondingly small, so that the stokes shift is essentially the same in all direc- 
tions. A damping time of the optical phonon wave t, is introduced pheno- 
menologically. It represents the effect of collisions with acoustical phonons or 
other degrees of freedom. The width of the spontaneously emitted Raman 
line is proportional to t,~*. 

The coupling of the light waves and the vibrations may be described in 
terms of a classical model, in which the polarizability of the molecule is a 
linear function of the vibrational coordinate (internuclear distance R), 


6= Ao “ (6a/6Q,) 0., 
where 
Q, = R (2p)'? exp (—ik,.¥) (27) 


is the optical phonon coordinate, and p is the reduced vibrational mass 
density. From the coupling term in the time-averaged potential energy, 


Uin = —(0a/00,) QO, E,*E,+complex conjugate, (28) 
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the following set of coupled equations may be derived for Q,* and E,, 

Ou* — Blk, —k,|7Q,* + 97 O,* + (2i/t,) Q,* = N(du/00,) Ex*E, (29) 

E,+0,’E,+ (2/it,) E, = 4xm,? N(da/00,)* E, Q,*, 


where the dots denote time differentiation. Phenomenological damping terms 
have been introduced for the vibrational and the stokes wave. Since the coupl- 
ing term is small, the set of coupled wave equations may be reduced to a set of 
coupled amplitude equations 


A,* ar (A,*/ T,) = (o,/ 2i) N (Ga/ 0Q) A,*A, 
A, + (A,/T;) = Gas 2ni@,) N(0a/0Q)*A,A,* (30) 


In these equations the amplitudes are represented as growing in time. The 
corresponding problem of spatially growing waves is obtained by replacing 
0/Ot bY V,oup (0/0z). The solution of this problem of two coupled waves is 
already familiar. There is one coupled mode with exponential gain and another 
mode with exponential loss. Since the optical phonon is heavily damped, the 
mode with exponential gain has predominantly stokes light character. To the 
extent that A,* or Usroup OA,*/0z may be ignored compared to the damping 
term A,* t,~', the steady state value 


A,* = (t,@,/2i) N(60/0Q) A,*A, (31) 


may be substituted directly into the equation for A,. A stokes wave with an 
exponentially growing amplitude results, if 


7(,0,) N*(a/8Q)* |Az]” t, > t."*. (32) 


The term on the left is the Raman gain, the term on the right represents the 
amplitude decay constant of the stokes mode in the stokes resonator, caused 
by reflectivity and scattering losses. The gain in time or in space may be 
equated to the expression for the Raman gain g, in terms of the Raman 
susceptibility or the spontaneous Raman cross section. One thus finds for the 
Raman susceptibility, in terms of this classical model for the Raman effect 


x" Raman — —n? N?(6a/0Q)?t,/20,. (33) 
Antistokes generation can be discussed in terms of this model by consider- 
ing the following four waves, the laser wave, the optical phonon wave, the 


stokes wave and the antistokes wave, satisfying the following energy and 
momentum conservation rules 


Dy, = 0, — Ws = Was Wrz, 


k, = k,—k, = k,,;—k,. 
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The laser wave is eliminated in the parametric approximation by considering 
the laser field as a fixed parameter. The optical phonon wave may be elimina- 
ted from the coupled equations, because its group velocity is near zero and it is 
heavily damped. In this manner one obtains two equations for the coupling 
between stokes and antistokes which leads again to a gain constant, identical 
to eqn (26). 

The two formulations are therefore equivalent, as far as the stimulated 
Raman effect is concerned. It should be remembered that the Raman effect is 
not always associated with a vibrational excitation. It could be an electronic 
excitation in which case the optical phonon should be replaced by an exciton 
wave. Or it could be rotational excitation, in which case a “rotational wave” 
would be involved, etc. The stimulated Brillouin and Rayleigh scattering may 
be considered in a similar manner. The interaction between the light waves 
and any other coherent excitation or boson wave can thus be described, and 
seemingly very different physical phenomena can be discussed from a com- 
mon point of view. 


7. INELASTIC RAYLEIGH SCATTERING FROM ANISOTROPIC MoLEcuLEs‘!*? 


In fluids with optically anisotropic molecules the dominant nonlinear effect is 
often caused by the reorientation of the molecules subjected to the strong 
electric fields of one or more laser beams. Consider an axially symmetric 
molecule with polarizabilities «, and «,, subjected to two parallel fields 
E,(@,) and E,’(w,'), which make an angle @ with the axis of symmetry. 

The orientation dependent part of the dielectric energy may be written as 


U = 4(a%—a,) (1—3 cos?@) [A,A;'* exp (—iAw?) 
+ A,*A;,' exp (+iAat) + |A;|?+]Az'|71. (34) 


Here Aw = w;—a,’ and the rapidly varying term at w,+@,’ has been 
ignored. There is a corresponding torque t’ = — 0U/0@ acting on the molecule. 
In addition there is the viscous torque responsible for a rotational Brownian 
motion. The Fokker—Planck equation for this motion was given by Debye 


af ao Oh af 


ee ees eee ee 
Pend o0 |= a0 | we 


For a sphere of radius a in a fluid with viscosity 4 the damping constant 
¢ = 8xna>. To lowest order in U/kT the steady state solution for the rotation 
distribution function is, if we introduce the orientation relaxation time 
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t = C/6kT, 
A,A,’* 
—iAwmt 


] 
[ee 


’ —1)||A;|?+|A 
Bot TEER i 20) @ 008? 0—1)| LAL)? + 14," AEE 


A, * 
x exp (—iAwt) + ae 


From this nonisotropic distribution, a polarization cubic in the electric field 
strength may be calculated, 


exp (+ idor)}. (36) 


PNE = INola,— —a,) {7 (3 cos? 9@—1) f2zx sin @ dO[A, exp (—iw,t) 


+ A,* exp (+iw@,t) + A,’ exp (—iw,'t) + Ay’* exp(+i@,’t)]. (37) 


This polarization describes not only the intensity dependent index of refraction, 
but also the modulation of this index at the beat frequency. This provides a 
coupling between waves spaced in frequency by Aw. Due to the phase shifts 
when Awrt is comparable or larger than unity, there will be a polarization at 
w,', proportional to the intensity |A,|?, and leading the electric field A,’ in 
phase. This will provide exponential growth at w,' of this wave, the maximum 
gain occurring for w,;’ = w,—t +. This is the stimulated inelastic Rayleigh 
scattering. 

It is instructive to consider also the coupling with the antistokes Rayleigh 
component and to allow the two waves to make a small angle @ with the direc- 
tion of the strong laser beam E,. We have again the set of coupled equations 
(25a) and (25b), but the nonlinear susceptibility constants are, for the orienta- 
tional Rayleigh case, 


AN ay ay 1 
= * — eee I = —____- 
Xs = Xan 45kT +a) (384) 
_ANo(ay—0)? 1 ee 


Ksabt = X"asut = 45kT  1+iA@t 


The wave vector mismatch Ak may be determined from the diagram in Fig. 
Ib. It should be kept in mind that the wave vector of the laser beam itself is 
changed, by the intensity dependent index, from the original value k, = 
wc” *€o'/? to the value k,(1 +467,,.)- 

Here a short hand notation is introduced, the meaning of which will shortly 
become apparent, 


22 Nol —«,)? |A,|? 162 


Oo = 
one 45kTé 
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The momentum mismatch may then be expressed in the form, for small 
angles 0, 

Ak = k, (67 +67,,,). 
Solving the two by two determinantal problem for the two coupled amplitude 
equations, one finds for the intensity gain constant 


60k 467, t 267, + 
0-3 [+ pata oO] treat] oD 
This gain is a maximum for @ = Oop: and its value is then 
PA 
fon k,8 opt (40) 


cial 1+ Aw?r? 


The behaviour is plotted in Fig. 2. For @ > Op: the gain is independent of 6. 


10 


Gain in units of gmex 


Angle of emission @ 
Fig. 2. The dependence of the gain constant on angle and small frequency shift Q = 
@L_—OzL' = wo''—crz for the near degenerate scattering process shown in figure 1b (after 
Chiao, Kelley and Garmire, Phys. Rev. Lett. 17, 1158, (1966)). 


There we have the uncoupled behaviour and the stokes gain is maximum for 
Awt = 1 and vanishes for Aw = 0. This stimulated inelastic Rayleigh scat- 
tering has been observed in the backward direction. 

For small angles 6 there is gain at Aw = 0 for 0<0<./2 6,,,. In this case 
two waves grow at the same frequency, but slightly off-axis from the original 
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laser beam. The growth of these off-axis components signifies the onset of 
self-focussing. Since the index of refraction is highest on the centre axis of the 
laser beam, where the intensity is highest, light rays will be bent towards the 
axis, since in the geometrical optics approximation the curvature of the ray 
is proportional to the transverse index gradient. The laser beam orientates the 
molecules in the fluid in such a manner as to create a converging lens. Since 
so many pairs of off-axis waves are involved, it is better to take the transverse 
variation of the field amplitudes and the concomitant diffraction effects into 
account by the following wave equation 


067A ie 0°A + Dik 0A 
Ox* ~~ dy? Me Be 


= —47k,? 7%" |A|? A. (41) 
We shall not discuss the solution of this equation, but it does lead to self 
focussing and possibly the creation of bright filaments ranging in diameter 
from 5-50 microns. Jn these filaments the intensity is much higher than the 
nominal flux density in the incident laser beam. This self-focussing phenome- 
non explains many anomalies in the observation of other nonlinear effects, 
especially the anomalously high Raman gain. The observed frequency broad- 
ening of stokes and antistokes components is caused by the modulation of the 
index of refraction. It is described by eqns (36 and 37), which give polarization 
components at new frequencies 2w, — w,’, etc. Both effects are especially pro- 
nounced in fluids with optically anisotropic molecules. 


8. STIMULATED BRILLOUIN EFFECT 


In this case the role of the optical phonon is taken over by an acoustical 
phonon. The main distinction between the stimulated Raman effect and the 
stimulated Brillouin effect lies in the different dispersion characteristics of the 
acoustical and optical phonon branches. The Brillouin frequency shift de- 
pends on the angle of scattering 9. This dependence can be derived from the 
resonant conditions of energy and momentum conservation, viz; 


ke od k, —k, and Dac = aclKacly~ * = O,— Qs. 


To a very good approximation the Brillouin frequency shift in isotropic 
media is given by, 
Oy = 2e,(nv,,/cy*) sin (6/2), (42) 


where y = C,/C, the ratio of the specific heats at constant pressure and density 
respectively, and v,, is the low frequency limit of the acoustic velocity. 

Again we have a coupled wave problem between one or more light waves 
and the acoustic wave. We write down the equations for an isotropic fluid, 


378 N. BLOEMBERGEN 


where the density fluctuations 
P = Pot Pp; xp (ik,. TP —1Dg¢ t) 


are coupled to a Brillouin stokes component travelling in the backward direc- 
tion. In the near forward direction the frequency shift becomes very small and 
there would also be simultaneous coupling to an antistokes component. 

The acoustic wave equation has the form 


2 op,* 1 de 
2a € Sel. 
ot? y ae Nie Tao Ot ~ an 6p 


2 * 2 
é Pi _ Vac 


— V*(E,E,*), (43) 


where T,, is the damping time of the acoustic wave. The right-hand side repre- 
sents the electrostrictive effect which drives the acoustic wave at the difference 
frequency between the Brillouin light wave and the laser wave. 

The Maxwell equation for the Brillouin stokes component is 


1 de 0? 


vz, - 45 [oe = Fp art Ee M1") (44) 


ce Fi 
The right-hand side represents the change in polarization due to the photo- 
elastic effect. For a medium obeying the Lorentz—Lorenz relation, de/6p = 
(e—1)(€+ 2)/3 po. 

The two coupled wave equations may now be treated in a manner similar 
to the treatment of the stimulated Raman effect. They can be reduced to two 
coupled amplitude equations. For a quasi steady-state situation, where only 
spatial growth has to be considered, the problem is again reduced to a two by 
two determinantal problem. Since the acoustic propagation velocity is small 
and the damping of the acoustic wave is large, we may again first solve for p, 
and substitute the steady-state value 


te. . 


ik, — k,|” A, A,* (45) 


a 2D An = 


into the Brillouin wave equation. This leads to an exponential gain constant 
for the Brillouin wave. If we introduce the Brillouin shift 


Ds” = (Ug¢7/) {k,—k,|’, 


the gain constant for the Brillouin stokes component may then be written in 
the form 


o, Gs acl, —k,|? 
Pre po (<— =). 1Ap|? cee 01 42,2 (@, 00,05)! (46) 
nN, C op 2(@;,—@,) 


Ss 


REVIEW OF NONLINEAR OPTICAL PHENOMENA IN CONDENSED MATTER 379 


This exponential gain has been demonstrated experimentally in a manner 
similar to that used to measure the gain constant in the stimulated Raman 
effect. The arrangement of a Brillouin backward stokes oscillator and a 
backward amplifier cell is shown schematically in Fig. 3a. By varying the 
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Fig. 3. (a) Diagram of the experimental arrangement to measure the backward Brillouin 
scattering gain. (b) The experimental data for a nonabsorbing liquid (solid squares) and the 
data with the addition of an absorbing material —« = 0-83 cm~1* (open squares). (c) Cor- 
rection due to the thermally induced effect. g7, is the difference between the experimental 
gain measured with and without absorption. The solid line is given by eqn (50). (After Pohl, 
Reinhold and Kaiser, Phys. Rev. Lett. 20, 1141, (1968).) 


composition of a liquid mixture in the oscillator cell, its frequency can be 
tuned through the gain profile of the liquid in the amplifier. The resulting 
curve is shown in Fig. 3b. When the liquid is absorbing, a very different result 
is obtained as shown by the open points. In this case the temperature changes 
and a new coupling to the dielectric constant must also be taken into account. 
It should be emphasized that these results are only valid in the quasi steady- 
state regime. For short laser pulses of duration comparable to or shorter than 
the acoustic damping time, transient effects must be taken into account, as 
discussed in the final section of this chapter. 


9. THERMALLY INDUCED BRILLOUIN AND RAYLEIGH SCATTERING 


In absorbing media the temperature variation and the concomitant changes in 
index of refraction must also be taken into account. Even in nonabsorbing 
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media there is an electrocaloric effect described by the intrinsic temperature 
dependence of the dielectric constant, (de/0T),. This term is, however, rather 
small compared with the effect due to (de/0p), in combination with thermal 
expansion. The combined behaviour of the stokes and antistokes light waves, 
the density wave and the temperature wave must be considered. The tempera- 
ture wave normally has a strictly diffusive behaviour but, in the case of second 
sound, a second time derivative of the temperature should be added to the 
equation. Unless one looks in the near forward direction, the two light waves 
are essentially uncoupled due to the momentum mismatch. We then have the 
following set of three coupled equations, 


2 8 OE, (=) 1 &(E,6,*) (=) 1 @(E, p1*) 
Vee 2 ot Net) oo? Ot? Bp lcet ae 
00,* (y—1)ép,* acn 1 (=) 
—KV76,* a = BME 7 
Nop aR oe phen et An NOE NS 
x —(E,*E), — (47b) 
i) eas 
Ops* en  , 2 OPr1*  BPoac’ 1 de 
aigee i a ia, Ce a 
ae ny ay ae role, 


x V(E,*E,).  (47c) 


Here the temperature variations at a frequency close to w,; — @, are denoted by 
0, = T—T); K (= 4/poC,) is the thermal diffusivity, C, is the specific heat at 
constant volume, « is the absorption coefficient for the light, B is the volume 
thermal expansion coefficient. The temperature variations are driven by adia- 
batic expansion, which can often be ignored in dense liquids (yx 1), by light 
absorption and by the electrocaloric effect. We have ignored the absorption 
proportional to «(|E,|*+|E,|”) which will give rise to a linearly increasing 
transient effect. The longitudinal compressive type acoustic wave equation is 
the same as in the preceding section, except for the addition of a term pro- 
portional to the thermal derivative. The light wave equation has an added 
term proportional to (0¢/0T),. 

We could write down the dispersion relation for this coupled wave problem 
in terms of a three by three determinant in k? and w*. For many practical 
situations we again assume a quasi steady-state solution for 0,* and p,* pro- 
portional to E,;*E, and substitute these values into the equation for E,. If the 
terms in (de/OT), are ignored, the resulting gain constant for the intensity 
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may be expressed in the form 


O, 


g.= — —*1A,)? 


n,c 
xl (de/6p)TuBn,c(v,,’k*/yC,) + po(Ge/Op) 77 k*(— Kk? — iw) (48) 
[~—w* + (v2, ,k?/y) + 2iwt,. 1] (—Kk? —iw) + i(1—y7 +) wv,,2k? | : 


where @ = w,—a@, and k = |k,—k,]. For « = 0 and y = 1, which holds for 
nonabsorbing, dense fluids, the expression reduces to the Brillouin gain given 
by eqn (46). 

The physical significance can best be appreciated by considering separately 
the regions where the frequency shift of the scattered light is very small, and 
and where it is nearly equal to the Brillouin shift w, = v,, k. 


(a) Thermal Rayleigh Scattering, o ~ Tp = Kk? 


In this case the denominator is resonant near zero frequency, and the gain 
constant is given approximately by, 


wo aBn,c { de @ 
= — —*{4,/? Li pena, (ge er ee eae 49 
GR ne u C, (+) oop o) 


Note that [', measures the width of the Rayleigh line, and that the gain 
constant is positive for negative w. This means that the stimulated thermal 
Rayleigh scattering exhibits an antistokes shift. Light of a higher frequency 
than that of the incident laser beam is generated. One would get the normal 
stokes shift, if the electrocaloric effect alone were operative. In the absence of 
absorption, only the terms (de/9T), which were ignored in eqn (48) would be 
active. The normal stokes shift also occurs in the Rayleigh wing scattering from 
anisotropic molecules. This is to be expected on thermodynamic grounds. In 
the case of absorption there is another source of energy available. The thermal 
energy makes conversion to higher energy photons possible. Experimentally 
the width is not determined by I’, which is usually very small, but rather by 
the line width of the laser source or by transient phenomena. 


(b) Thermal Brillouin Scattering, w ~ wz = v,,k. 


In this case the last term in the numerator of eqn (48) leads to the normal 
Brillouin gain, given by eqn (46). To this must be added a correction from 
the thermal effect 


o,|A,|7 ( 0& apn, c Tac Op (@— pz) 
nec \ép/r C, 2% 14+%,,7(@—@z,)*" 


(50) 


ITs = 
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This expression is positive for a, —w,—@gz = @—@,>0. There is more gain 
above the exact downward Brillouin frequency shift w,;. This “dispersive” 
behaviour of the gain is shown in Fig. 3c. There is very good agreement 
between theory ‘!” and experiment.2 

This absorptive effect could be superimposed equally well on the stimu- 
lated Raman effect. The thermal expansion coefficient 6 must then be replaced 
by a constant which describes the thermal variation of the optical phonon 
frequency, or of (éa/6Q). 

The derivation given here applies only to a quasi steady-state. Transient 
effects during very short laser pulses will show a considerably different be- 
haviour, since thermal relaxation and thermal expansion processes may not 
have time to develop. 


10. STIMULATED RAMAN SCATTERING FROM POLARITONS 


In ionic piezoelectric crystals the optical phonon modes are directly coupled 
to an electromagnetic mode with the same frequency and wave number. A 
mixed mode results, which is simultaneously infrared and Raman active. This 
coupling of the ionic and electronic oscillators results in a polariton wave 
which has both electromagnetic and vibrational character. When a laser beam 
is incident on such a crystal it is possible to generate the stokes frequency equal 
to the laser frequency minus the polariton frequency. The coupling coefficient 
is an interference of a Raman-type coupling and a parametric coupling be- 
tween three electromagnetic waves. As Shen“® first pointed out, the simul- 
taneous coupling of the wave at the stokes frequency w, = @, —@,, the optical 
phonon at w, which has an uncoupled eigenfrequency wo, and the infrared 
electromagnetic wave at w, must be considered, 


V7E,+&(@2/c)?E, = —4n(@/c)?(Ndg E, O* +Nd, E, E,*), (51a) 
V7E,* +6,,(@,/c)?E,* = —4n(@,/c)?(Nd;* E,* E,+NeQ*), (51b) 
— po,? O* + iol wO* + ww,"O* = eE,*+do* E,* E,. (Sic) 
Here yp is the reduced mass of the ionic oscillator with an effective charge e. 
The quantity dg is equivalent to the variation of the polarization with the 
ionic coordinate, which was called da/0Q in eqns (27-32) of Section 6. We 
have ignored the dispersion term proportional to f in the optical vibration. 


The quantity d, is equivalent to the nonlinear susceptibility used in eqns (19). 
Assume exponential solutions for the three fields of the form 


E, = A, exp (ik, .r+y,2.r) 
E, = A, exp (ik, .r+y,-¥) 
Q = QO, exp (ik, T+, Tr) 
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with k, = k,—k, and y, = y2. Substitution into the three coupled equations 
leads to a secular equation, from which y can be determined. (There can only 
be gain in the direction normal to the crystal boundary.) 


4ndgAr (dg/ e)Ar £2 —(@/ c)*(K, = iy)? 
bo — (c?/o, (kK, + iy)* 1 4nd,A,* =0 
=| p(—@,7+@o7 +iwy)/4nNe? —(dgle)A,* 
(52) 


For the detailed algebraic solutions we refer to the literature.“ *) The following 
approximation is, however, very good in the neighbourhood of the polariton 
resonance. Here the vibration damping T is dominant. One solves the second 
equation for E,*, substitutes into the equation for Q* and solves for Q* in 
terms of E,*E,. Substitution into the first equation then leads to an expon- 
ential gain for the wave E,. The result is 


2 _ 4nNew,? L(@* —@,”) 
2(ky—&2°!? wc” *)—2iy = ee |A;|? (Ace d+ do) 
4nNe*/p : - 
Wenge © 


The last term contains a resonant denominator. Its real part vanishes for 


w,” 4nNe? 
Kes | ronenle 
vee Pe" @e=07) ey 


This is precisely the dispersion curve for the undamped polariton. If there is 
momentum matching between the laser wave, the stokes wave and the un- 
damped polariton, the gain is maximum. The gain constant is proportional to 


Ned 2 
( zs H(@o? —@, ) 


which describes the interference of the direct electronic parametric coupling 
and the Raman type coupling via the ionic vibrations, which is resonant for 
1 = Wo. The gain constant itself does not exhibit resonant behaviour at this 
frequency. As the ionic vibration is excited, there is a concomitant electro- 
magnetic radiation component. Because of the strong absorption, the power 
flux density at this infrared frequency is small compared to the flux density at 
the stokes frequency. The ratio is about g,@,/w.%, where g, is the power gain 
constant at w, and « the absorption coefficient at @,. 

The variation of the coupling parameter when the difference frequency of 
the two visible light waves goes through the ionic infrared resonance has been 
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observed in GaP, when visible and infrared laser beams were mixed. The stim- 
ulated Raman scattering from polaritons has been observed“ in LiNbO3. 
The stokes shift of the Raman scattering in the forward direction is 497 cm~!, 
while in the backward direction it is 631 cm~ +. In this last case the momentum 
of the wave corresponds to an essentially pure TO phonon. In the former case 
the dispersion curve is cut at an appreciably lower frequency of mixed vibra- 


tional character. The dispersive behaviour of two A, symmetry modes in 
LiNbO; is shown in Fig. 4. 
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Fig. 4. The polariton dispersion curves characteristic of A, symmetry modes in LiNbO3. 
The scattering angle 6 is the angle between the incident laser light and the stokes radiation. 


The conditions for energy and momentum matching are indicated for Raman scattering of 
ruby laser light. (After Gelbwachs et al.¢%), 


The complications which arise when more optical phonon branches are 


present, or when account should be taken of the different modes of polariza- 
tion, will not be reviewed here. 
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11. CouPLING oF LIGHT WAVES WITH OTHER EXCITATIONS 


The pattern that has been established in preceding sections can be extended to 
the coupling between all types of classical waves, which correspond to boson 
excitations. These include plasmons, spin waves, helicons, rotons etc. All these 
waves are, of course, not only coupled to the light waves, but are also coupled 
to each other. The spin waves are, for example, strongly coupled to acoustic 
waves through the magnetoelastic coupling. Plasmons in semiconductors can 
also be strongly coupled with infrared phonons. If the coupled wave problem 
is linearized, the complex propagation constants with corresponding gain (or 
loss) may be obtained from a secular determinantal equation. The physical 
nature of the coupling is, of course, different for each pair of waves. 

Here only the coupling of a light wave with a plasma wave and with a spin 
wave will be written down as further examples. 


a. Plasma Raman scattering 


Consider a small volume element at the point r. Let the average deviation of 
the electrons from their equilibrium position in this volume element be u(t). 
Introduce normal coordinates Q, as the Fourier transform of this average 
deviation or local strain of the electron gas, 


Q, = fae exp (—ik .r)d?r. 


The canonical conjugate to this variable is P,. The Hamiltonian density for 
the plasma waves then takes the form, 


H plasma ~ $ » [1 /Nm)P, ¢ P_.+ak?Q, . Q_.+42N7e?Q, o Q_x]. 
: (55) 
Here N is the average number of electrons per unit volume and « is the bulk 


modulus of the electron gas. The fluctuation in the electron density from the 
average due to the presence of plasma waves is 


dp(r) = N divu = iN Yk. Q, exp (ik. 1). 
k 


The change in the interaction of the two light waves with the electrons in a 
unit volume due to the presence of the plasma waves is consequently 
H ing = (€? /2me*)A*5 p(x), 
where 
A = A, exp (iq, .r—iw,t)+ A, exp (iq, .r—im, t)+c.c. 
When all nonresonant perturbations are truncated, the interaction Hamilton- 
ian density between the two linear parallel polarized light waves and the longi- 
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tudinal plasma waves (Q||k) becomes, 


HK in, = (iNe?/mc?) ¥ kAy A,*Q,* exp [iq,—4,—k).r]+c.c. (56) 
k 


The equations of motion for the plasma coordinate are 

Py = — OF stasmat A int) OQk 

On = +O ptasmat H int)/OP 
These equations of motion can be combined into a wave equation for Q,. 
Because of the presence of #;,,, a driving term proportional to the light ampli- 
tudes A,A,* is added to the plasma wave equation. Landau damping and 


damping by collisions may be taken into account by a phenomenological 
damping term. The wave equation for Q, is 


O,+aV?0,+0,Q, = (ie?k/m?c?)A, Ag* + (2i,/t’) Q,. (57a) 


The wave equations for the light amplitudes A, and A, are also augmented 
by a nonlinear term, because the interaction Hamiltonian gives rise to a non- 
linear current density, 


PE (@,) = —CO# ip lOA* = (+iNe?/me) kA, Oy" 


and a similar expression for j¥“(w,). The wave equations for the two light 
waves become, consequently, 


~ 4, +02V2A, = (AniNe?/m) kA, Oy, : (57b) 
— A,+¢?V7A, = (4niNe?/m) kA, Q,*. (5Tc) 


The set of three coupled nonlinear wave equations is familiar from the Bril- 
louin and Raman effect in other media. 


b. Spin wave Raman scattering. 


The dominant physical mechanism for the coupling is via the spin-orbit inter- 
action. Suppose the ground state of an ion is an orbital s-state, with m, = 0, 
and a spin m, = +4. Consider an excited p-state with a spin-orbit coupling. 
It admixes the states with m, = 0, m, = 4 with m, = 1, m, = —4. A light 
wave with a circular polarization around the direction of quantization 2 of the 
spin wave will have an electric dipole matrix element 


«Ss, my, = 0, mM, = —4ler* E,|p, my, = 1, mM, = =>): 


Another light wave with a linear component in the 2 direction will have a 
dipole matrix element 


<p,m, = 0, m, = 3|ezE,|s, m, = 0, m, = 4). 
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Since the two intermediate states are linked by the spin-orbit coupling, it is 
thus possible for the two light waves to produce a spin flip in the ground state. 
Very crudely, the value of the matrix element for the spin Raman effect will be 
lowered by the ratio of the spin-orbit coupling parameter to the orbital split- 
ting. The two light waves will drive the spin excitation on each ion site with a 
phase determined by the wave vector difference q = k,-—k,. They are thus 
driving a spin wave of wave vector q and frequency @,—@,. Significant coup- 
ling will only occur, if both energy and momentum matching can be satisfied. 
The equation for the light driven spin wave is 


in Oy til+ pv? | <S,(,0)> = AE, E,*. (58) 

In combination with the wave equation for the stokes wave E,, the parametric 
gain of these waves pumped by the laser field E, may be derived in the usual 
manner. 

These couplings have not yet been observed as stimulated effects. Spontan- 
-eous Raman scattering has, however, been studied from a wide variety of 
excitations. These include localized impurity vibrations, localized impurity 
spin modes, light scattering with the creation of two spin waves, two phonons, 
two rotons in liquid helium II, with a vibrational excitation in solid ortho- 
hydrogen, concentration fluctuations in fluid mixtures, etc. The quantum 
description of these spontaneous scattering processes is outside the scope of 
this review. 


12. TRANSIENT RESPONSE 


The discussion has so far been restricted to steady state type solutions. In 
most cases we have assumed monochromatic waves growing in space. Pure 
temporal growth can be treated in the same manner, as was shown by the fact 
that eqns (30) have the same form as eqns (25). A combined growth in space 
and time requires, however, the solution of coupled partial differential equa- 
tions of the form given by eqns (19) and (20). Such solutions are of practical 
importance, because the powerful laser pump is often a short travelling pulse. 
The pulse duration may be quite short compared to the characteristic damping 
times. Consider for example the case of the stimulated Raman effect, when the 
excitation is by a mode-locked picosecond pulse laser train. The life time of 
the vibrational-rotational state of the molecule will often be longer than 107 1? 
sec. This is certainly so in a gas. One has a kind of shock excitation of each 
individual molecule, but this excitation does not propagate, as the velocity of 
the vibrational wave is small. The stokes light will however travel in syn- 
chronism with the laser pulse, as long as there is no dispersion in the group 
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velocities. We thus have a spatial growth of the stokes light, coupled with a 
pure temporal growth of the vibration. The gain in this situation must be 
derived, not from eqns (30), but from the coupled partial differential equat- 
ions, 


0A, OA, A 
Ss s Ss = * 
0A,* OA, A, 


os are €2A,. (59b) 
The constants ¢, and c, are shorthand notations for the expressions that occur 
on the right hand side of eqns (30). The laser amplitude is considered to be a 
constant parameter. The constants are related to the pure spatial Raman power 
gain constant g, given by eqn (24) as follows; 2c,c,t, = g,v,. Rather general 
solutions of the set of eqns (59) can be found. They have the same form as the 
current and voltage equations on a transmission line. Transient pulses on such 
lines have been studied extensively. Solutions which are applicable to the 
stimulated Brillouin and Raman scattering have been given by Kroll"? and 
by Wang.°?”? 

To minimize the algebraic complexity while retaining the essential features, 
let us assume that the damping t,~* of the light wave and the group velocity 
of the vibrational wave are negligible. The substitution x = z+ v,¢ then trans- 
forms the equations to 


aA, 7 
By oie ‘AL*, 
0A,* A 
v v = A 
Ot + Ty C2 S? 
or 2 
0O°A 1 6A 
ied Seay EAS (60) 


OXOF Ty. OX 27°" 


A,* satisfies the same equation. The damping term may be eliminated by the 
substitution G = A, exp (¢/t,). A special solution of the hyperbolic equation 


0?G 9s 


ne aes C= 
Ox 6t = _2t, : 


is the Bessel function of imaginary argument J [i/(29,xt/t,)]. Solutions satis- 
fying other initial and boundary conditions may be found by the Riemann 


procedure.) 
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Consider the case where the medium, continuously pumped by the constant 
laser field, has a very short stokes pulse as input. Before the pulse arrives at a 
certain volume element, there is no excitation of the molecular vibrational 
coordinate. The initial conditions in this case are 


A,(O, t) = Go(t), (6A,/Ox),20 = 0. (61) 


Here G,(t) is the shape of the pulse at the entrance of the Raman cell. 
The solution satisfying eqns (60) and (61) is 
: OJ {iJ L2g,(t—t’)x 

Acs) = Gol | Gol)expL— (= t)fed PERE OTD gy 
0 t (62) 
For a short rectangular pulse of duration t,, Go(t) = Gp for 0<7<z, and vani- 
shes outside this interval. The cell length satisfies the condition / > v,t, and 
appreciable gain is assumed so that the argument X = ./(2g,t,x/t,)>1. In 
this limit the following asymptotic expression may be used 


8J9 (GX) 


ales —iJ ,(iX) > e*/(2nX)'”. 


The major contribution to the integral (62) will come from values t’~0, 
where the integrand is largest. In this approximation one finds 


A, = Go(4ng,xt/t,)~ "/? exp [(2g,xt/t,)"/? —t/z,]. (63) 


The leading edge of the pulse retains the value Gp but the trailing edge, t=1,, 
reaches its maximum value at the end of the cell x=/. It is seen that the transi- 
ent gain is greatly reduced compared to the steady state value g,/, and increases 
as the square root of the cell length. The solution given by eqn (63) is valid 
only for short times such that the gain dominates the loss in the exponent, 
ie. for t<2g,/t,. The transient regime holds if the pulse duration is short 
compared to the life time of the vibrational state multiplied by the steady state 
gain. For times longer than this, the gain rapidly approaches its steady state 
value as follows from eqn (62) if an upper limit t— oois taken. 

As the transient gain increases, the pulse becomes sharper in the time do- 
main. It reaches half its maximum value at a time ¢t,— At,)2, where 


exp {2g,(t,—Aty/2)/t,}"/? = 4 exp {2g,lt,/t,}"/7 
or 
Atyj, © 210 2(t5/G mas): 


This sharpening in time implies, of course, a broadening of the stokes spect- 
rum in frequency. This is just the opposite from what happens in the pure 
steady state Raman effect, where the line is narrowed since the maximum gain 
occurs at the centre of the resonance. A short input pulse implies a certain 
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frequency spread in the incident signal. As amplification takes place different 
frequency components in the vibrational and stokes waves will beat with each 
other. This leads to a gradual broadening, in a way which has some similarity 
to the self broadening caused by modulation of the intensity dependent index 
of refraction. The transient gain effect may thus lead to very short pulses. The 
limit is set by the dispersion of the group velocities if the laser pump is also a 
short pulse, but if the pump is a quasi-continuous long pulse, the fundamental 
limit appears to be a duration comparable to the inverse of the vibrational 
frequency itself. 

For a short laser pump pulse, there is of course only gain for the stokes 
pulse travelling in synchronism in the forward direction. There is no signifi- 
cant gain for a backward travelling light pulse. The same considerations hold 
for the stimulated Brillouin effect, since the group velocity of the acoustic 
phonons is also negligible compared to the light velocities. Since the acoustic 
phonon damping times are often several orders of magnitude longer than the 
vibrational-rotational life times, the transient regime is first reached for the 
acoustic waves and the gain for Brillouin scattering is correspondingly re- 
duced. Although the steady state gain for the Brillouin process may be larger 
than the steady state gain for the stimulated Raman process, the latter may 
dominate when the laser pulse duration satisfies the condition G,,1,.>t,> 
Tyip- SUCH competition between nonlinear effects has also been observed. 

The deformation in time of a laser pulse propagating in a medium with an 
intensity dependent index of refraction provides another example of a transi- 
ent nonlinear effect. The self-focussing occurring in media with a quadratic 
Kerr effect must be extended to include the combined space-time behaviour. 
Here we shall briefly discuss the development in time of a spatially homo- 
geneous plane wave pulse. Since the index of refraction is highest, when the 
intensity is highest, the maximum of the pulse will travel at a slower velocity 
than the rest. The pulse will thus get a steeper trailing edge, while the leading 
edge becomes more gradual.7*) Before the shape of the envelope is modified 
significantly in this fashion, the phase retardation, which is also a function 
of the intensity, will have an appreciable time dependence. This will lead 
to a characteristic observable frequency broadening of the spectrum of the 
pulse.‘?*:?© Our starting point is again the nonlinear complex amplitude equa- 
tion 

0A CA io 
oot a= Tae de(t)A(z,t). (64) 
The nonlinear polarization corresponding to the quadratic Kerr effect in the 
steady state is given by 


P**(@o) == Xt (—@p, Wo,— Wo, Wo) |Al? = (€,/47) |A|?A. 
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The temporal behaviour of de(t) is governed by the Fokker—Planck equation 
for rotational relaxation discussed in Section 7. It approaches its instantan- 
eous equilibrium value ¢,|A|* with a relaxation time rT, 


O(Ge) 
at 


— ~ (Ge — ell?) (65) 


If the pulse width v,t,<z</ is very short compared to the cell length /, the 
time derivative on the left-hand side of eqn (64) may be neglected in com- 
parison with the spatial derivative. Substitute the solution for de(t) and separate 
eqn (64) into its real and imaginary parts writing the complex amplitude as 
A = |A|exp(—id). The amplitude of the pulse |A(t’)| may be considered 
independent of z, if pulse steepening or pulse distortion due to dispersion of 
the group velocities is negligible. Under these conditions the phase ¢ is given 
by the simple expression 


be.) = “222 [" expt ACPA. (66 


26g.) = 
The frequency components of the pulse having propagated a distance z are 
given by the Fourier transform 


A(q@, Zz) = xf |A(t)| exp [i(w@—@o) t] exp [—id(z, t)] dt. (67) 


The methods of stationary phase applied to this expression show that the 
frequency shift is essentially given by the slope of the phase curve —0¢/6t. 
There are usually two points f, and ¢, on the phase curve with the same slope. 
These points can interfere constructively or destructively. If they have about 
equal amplitudes | A(t,)| and |A(t,)|, the Fourier component will be maximum 
or near zero at some frequencies. If the relaxation time is very short, —0/ét 
is proportional to d|A(t’)|?/dt’ and the broadening would be symmetrical for 
an initially symmetrical pulse. For long relaxation times — 0¢/0t is essentially 
proportional to —|A|* and the broadening should occur mostly on the low 
frequency side. All these features are consistent with detailed numerical cal- 
culations which evaluate | A(q, z)|* from the expressions (66) and (67) and are 
shown in Fig. 5. This typical spectral shape also agrees qualitatively with many 
experimental observations of the broadening of short pulses. The results are 
also consistent with the picture of repeated beating of Fourier components in 
the frequency domain. The effect can be described as a transient modulation 
of the index of refraction. 

Finally, transient effects in thermal Rayleigh scattering should be men- 
tioned. Some characteristic solutions of eqns (47) in Section 9 displaying 
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combined spatial-temporal behaviour have recently been compared with ex- 
perimental results.°*” A remarkable feature is that there is a transient gain for 
zero frequency shift and that the gain initially increases quadratically with 
time. 
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Fig. 5. The initial pulse has a Gaussian shape with a full width at 1/e level given by t, == 
5x10~12sec. The circular frequency wo corresponds to the ruby laser wavelength. The 
relaxation time of the dielectric constant in CS. is taken as t = 2 x 10~!*sec. The frequency 
broadening is calculated at a cell length corresponding to the position before appreciable 
envelope distortion has occurred. The intensity in arbitrary units of the spectrum is calcu- 
lated from eqns (66) and (67), (After Gustafson et al.?). 


Another interesting effect is forward thermal Rayleigh scattering induced by 
picosecond pulses.® It is clear that on this time scale density fluctuations 
have no time to develop. The effect can be understood in terms of eqns (47) as 
follows. The spatial and/or temporal beat of two incident laser pulses creates 
a temperature variation 0,(@, —@,’,k, —k,'). This may be regarded as a steady 
state process, if the distribution of the absorbed power by the dye molecules 
to the surrounding solvent molecules is instantaneous. The density variations 
on this time scale of a few picoseconds are, however, negligible and one may 
take p, = 0, but a light wave at the new combination frequency 2, —@,’, 
and in the new direction 2k, —k,' can be driven by the term proportional to 
(de/0T),0,E, in eqn (47a). Many spatial and frequency side bands can thus 
be coupled together by the thermal Rayleigh scattering on this time scale. 
Even if the absorbed power is not immediately distributed over all degrees of 
freedom in the fluid, the effect would still be present. One should then take 
account specifically of the variation of the dielectric constant with the time 
varying occupation of various excited vibrational and rotational degrees of 
freedom. A number of characteristic constants and relaxation times would be 
needed to describe this in detail instead of the single constant (de/dT)p. The 
experimental results'?®) confirmed that the coupling was proportional to the 
absorption coefficient of the dye solution, and to (ée/éT), characteristic of 
the solvent. Furthermore spatial side bands could be created on a probe beam 
at a frequency which was not absorbed by the dye. This is consistent with the 
picture of an index modulation proportional to (d¢/0T), 0;. 

This cursory review is, of course, far from exhaustive. Its purpose has been 
to show how the extremely rich variety of nonlinear optical phenomena in 
condensed matter can be understood fairly well on the basis of a rather 
simple and unified classical point of view. 
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Optical Pumping and Related Topics 


G. W. SERIES 
I. A PRELIMINARY SURVEY 


1. INTRODUCTION 


We propose to give an account of work which is commonly included under 
the titles: 
(i) optical-radio frequency double resonance experiments, 
(ii) level-crossing experiments, 
(ili) optical pumping experiments. 

For the most part, such experiments have been performed on free atoms, 
and the results that have been obtained are of interest in atomic spectroscopy 
in particular, and in atomic physics more widely, but the methods have also 
been applied to molecules and to ions in solids. 

The experiments have in common that all the techniques are based on the 
interaction of light with the atoms, either in absorption or in emission or both. 
There are close connections with experiments in which electrons are used for 
excitation. 

The light fields we encounter are weak: we shall not be concerned with 
non-linear effects, nor shall we be concerned with the statistical properties of 
the fields. 

Historically, the experiment which introduced this whole field of work was 
the optical radio-frequency double resonance experiment of Brossel and Bit- 
ter.“) This showed how it was possible to apply to excited atoms the tech- 
niques of magnetic resonance which had already been worked out for atoms 
in their ground states (atomic beam resonance methods). 

We shall find it convenient to take the Brossel-Bitter experiment as our 
starting point, then to discuss level-crossing experiments, and then optical 
pumping. This is not the historical development. In the first section we shall 
aim to introduce the whole field, by discussing at length some simple examp- 
les. In later sections we shall take up subtle points and elaborate on techniques 
used in other experiments of the same types. 
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Fic. la. Schematic apparatus for performing the Brossel—Bitter experiment. 


2. THE BROSSEL—BITTER EXPERIMENT 


The Brossel-Bitter experiment measured the Zeeman splitting of the excited 
state 6s6p °P, in mercury by magnetic resonance. The atoms are present in 
the form of a vapour at very low pressure in the horn-shaped vessel and 
irradiated by light from a mercury lamp (Fig. 1a). They undergo the process 
known as resonance fluorescence which can be described (crudely, but well 
enough for our immediate purpose) as the absorption of a photon, followed 
by the emission of a photon. The atoms remain in the excited state for a time 
determined by the probability of spontaneous emission (Einstein coefficient 
A). The mean lifetime is t = A~+. This is the time available for the magnetic 
resonance transition. 

In order to secure an unequal distribution of atoms over excited states 
m (Fig. |b) the incident light is polarized. Magnetic resonance is stimulated by 
an oscillating magnetic field H, cos @ t produced by a current in the coils 
shown in Fig. la. The oscillating field is at right angles to the static field H 
which produces the splitting of levels shown in Fig. 1b. This geometrical ar- 
rangement stimulates transitions Am = +1. 


OPTICAL PUMPING AND RELATED TOPICS 397 


m 
+1 


1S, fo) 


Fic. 1b. Resonance fluorescence in mercury stimulated by 7-polarized light. 


1 


es RH 


Fic. 1c. Change of polarization of fluorescent light resulting from radio-frequency 
transitions. 
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Atoms undergo these transitions: 
(a) if the amplitude H, is sufficiently strong, and 
(b) if the frequency Wg is sufficiently close to the splitting in the field H. 


The resonance is detected by making use of the profound effect which the 
transitions have on the polarization of the emitted light: compare Figs. 1b and 
lc, and notice how, as a result of a transition, the polarization is changed from 
m to a. These have different spatial distributions. A photomultiplier in a fixed 
position registers the change. Thus the absorption or emission of a quantum 
of energy at the radio-frequency determines the polarization of an emitted 
quantum of vastly greater energy at the optical frequency. That is why this 
optical method of detection is so much more efficient, when one comes to con- 
sider questions of signal-to-noise ratio, than a direct measurement of the 
absorption of radio-frequency energy. 

Figure 2 is an oscilloscope display of a Brossel—Bitter resonance. It was 


Main resonance 


Fic. 2. Double resonance signal in the 6°P, state of mercury. 
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obtained by keeping the frequency fixed at about 6 MHz and sweeping the 
magnetic field through the resonance region. It is worth noticing that this 
curve was obtained with very simple methods of detection: wideband am- 
plification of the voltage developed across a resistor in the output of the 
photomultiplier. 

The curve is double-peaked because the transitions Am = +1 are closely 
coupled. It is similar to the resonance curve of two over-coupled resonant 
circuits. The curve becomes single peaked when the radio-frequency field 
strength is reduced. 

We shall shortly show how the shape of such resonance curves can be cal- 
culated. First, we make some remarks on the properties of the oscillating field 
required to stimulate the resonances. Our considerations here are not limited 
to the Brossel—Bitter experiment in particular; we have in mind this type of 
experiment in general. 


2.1 AMPLITUDE OF OSCILLATING FIELD 


It is important to consider what amplitude of radio-frequency field is necessary 
to induce transitions. A classical interpretation of magnetic resonance sug- 
gests that the field should be strong enough for the excited atom to precess 
through an angle of about one radian within its lifetime, t . Recalling that the 
precessional velocity in the field H, is yH,, where y is the gyromagnetic ratio, 
we have for the required condition 


(yH,)t = I rad. (1) 


More formally, we find the factor <J, M|J, + iJ,|J, M + 1> on the left- 
hand side. Using typical values for t (107 ®sec) and a value of y corresponding 
to the Bohr magneton (1-4 MHz/G) we arrive at H, ~ 10G. Actually, this is 
an over-estimate because we can detect changes of angle smaller than 1 rad, 
but it is clear we must think in terms of amplitudes of the order of a few gauss. 
Moreover, in order that the Zeeman levels be non-degenerate, we must think 
in terms of frequencies of the order of 10° rad sec™', i.e., in the range 10-100 
MHz. 

Clearly, in order to drive such a resonance we need high frequency oscil- 
lators with power amplifiers. These requirements stem immediately from the 
fact that we are dealing with excited states. At a later stage, when we come to 
consider resonances in the ground states, we shall find that the longer life- 
times allow the use of fields of the order of milligauss, and that kilocycle 
frequencies are often appropriate. 

The use of high power, high frequency fields in studying resonances in exci- 
ted states often raises technical problems which we shall discuss later. 
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2.2. WIDTH OF RESONANCE CURVES 


An important characteristic of a resonance curve is its width. In the present 
case we can derive a minimum figure for the width by thinking in terms of 
Fourier transforms. The time available for the oscillating field to interact 
with the atom is limited by the mean lifetime of the excited state, hence the 
width of the resonance must be of order t~*. In quantum terms the point to 
notice is that the excited energy levels of Fig. 1b are broadened by radiation 
damping, so that the width of a resonance between a pair of levels cannot be 
less than the sum of their widths, 27 = 217'. 

This is a minimum width. It may be increased by radio-frequency power 
broadening (shortening of the lifetime through transitions induced by the oscil- 
lating field) or by inhomogeneous static fields. The latter is not a serious 
problem for excited state resonances, though it is for ground states. 

Of course, a precise definition of the width of a resonance curve rests upon 
knowledge of the profile. 


2.3. ANALYTICAL EXPRESSION FOR THE LINE PROFILE 


The following is a version of the calculation given by Brossel and Bitter. In the 
interests of simplicity, let us make the system effectively two-level by suppos- 
ing that we are detecting the light through a circular polarizer (o*) in front of 
the photomultiplier. Then we are concerned only with transitions 0 to +1. 

Suppose an atom is excited at time fg. (This is the “‘pulse”’ approximation. 
It is valid if the spectral distribution of the exciting light is much greater than 
the natural width of the excited states. We shall frequently use this approxi- 
mation, and discuss it in a later section). 

Let P(t, to) be the probability that, at time ¢, the atom has made a transi- 
tion. This can be calculated from the theory of magnetic resonance. The result 
is 

P(t, to) = (b7/p”) sin* $p(t— to), (2) 
where 
b=yH,;; p= y(H-Ho)’ +H; Ho = a/?; 


H is the static field and H,, = 2H, cos wy ¢ the oscillating field. (Only one 
rotating component of H, is effective: we are ignoring the counter-rotating 
component). 

The probability that the atom has survived in the interval (t—Zo) is exp 
[—I (t—¢)], where Tis the decay constant for the excited states (the same for 
both). To find the probability of emission of a photon of different polariza- 
tion, eqn (2) must be multiplied by this exponential factor. 

If atoms are excited at the uniform rate R, the number excited in time dfg is 
Rdt,. Hence, finally, if the atoms are excited independently of one another, 
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the steady-state intensity of light reaching the detector is 
t 
I(H) x | (6 /p*) sin? $p(t— to) exp[—T'(t—to)] dt 
0 
oc b?/(p? +I) 


= (H,)’/{T? + (A —- A)? + (H)} (3) 


This is a Lorentzian curve of full width at half-intensity, 2-H ,?+(I/y)*1:. 
As H, tends to 0 the width tends to 2I (in radians sec~*+). The power- 
broadening is thus quantitatively expressed. 

In Brossel and Bitter’s paper the probability P(t, tg) of equation (2) is re- 
placed by the more complicated expression representing a transition out of the 
state 0, leading to a more complicated version of (3). 


2.4. DOUBLE RESONANCE LINE WIDTH COMPARED WITH OPTICAL LINE WIDTH 


The quantitative remarks we made earlier about line widths are borne out by 
these calculations and exemplified by Fig. 2. The line width of a few gauss in 
that figure corresponds to a frequency of the order of 1 MHzand a lifetime of 
order 107 ’sec. This line width should be compared with the line width in 
optical studies of the Zeeman effect—typically, several hundred gauss. The 
Brossel-Bitter experiment therefore allows the measurement of Zeeman in- 
tervals in excited states with a precision orders of magnitude better than 
optical measurements. 

The method is not limited to the measurement of Zeeman intervals. We 
shall indicate in the next section how it has been applied to determine hyper- 
fine structures. 


3. DOUBLE RESONANCE METHOD: HyYPERFINE STRUCTURES 


The basis of the double resonance method is to make use of the changes of 
polarization of fluorescent light which result from magnetic resonance transi- 
tions AM = +1. This selection rule stems from the existence of matrix ele- 
ments of the (x—y) component of the magnetic moment operator 


fi = (6/h)(L+ 28). 


In the Brossel-Bitter experiment the states were the eigenstates |M) of 
j, = £,+28, and fi,y has non-vanishing matrix elements between these states. 
The static magnetic field was used only to remove the degeneracyof the |M>. 
In an atom having hyperfine structure with weak field quantum numbers 
F, Mr, the operator f,, has matrix elements between states |F, M,»> and 
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|F’, My +1), where F’— F = 0, +1. The polarization of the light is determined 
by the value of M,, irrespective of whether levels of different M, are de- 
generate. Hence, the double resonance method may be applied to measure 
hyperfine intervals in zero field (F’- F = +1). It is also, of course, still appli- 
cable in fields strong enough to decouple I and J, and in intermediate fields. 
We shall illustrate this by reference to an experiment of Ritter and Series‘” 
on an excited state of potassium. 


3.1. HYPERFINE RESONANCES IN ZERO FIELD 


The apparatus is shown in Fig. 3. It is, in essentials, the same as that used by 
Brossel and Bitter, but differs in points of experimental technique. Most signi- 
ficant is that the potassium is introduced into the resonance region in the form 
of a crude atomic beam. The use of a hollow cathode light source is also 
noteworthy. 

The excited states of interest here were the second excited P states 5’P3/. 
whose hyperfine structure allows a determination of the nuclear electric quad- 
rupole moment. The level scheme is shown in Fig. 4a. A blue filter eliminates 
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Fic. 3, Double resonance experiment to study the hyperfine structure of 57P3/2 of 
potassium. (After Ritter and Series.) 
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the much stronger fluorescence from the 47P states. Fluorescence from 57P,,> 
passes the filter but its intensity is unchanged by the magnetic resonance un- 
less circular polarizers are used (see later). 

The hyperfine structure of 57P3,. is shown in Fig. 4b. Magnetic resonance 
curves obtained by varying the frequency while keeping the amplitude H, 
fixed are shown in Fig. 5. The interpretation of these curves is not easy without 
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Fic, 4a. Resonance fluorescence of the second excited state in potassium. 
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Fic. 4b. Hyperfine structure of 5?P3/2, showing the double resonance transitions. 
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additional information (based partly on the known h.f.s. of the ground state 
and partly on analysis of double resonance curves in strong magnetic fields). 
With this information one can be confident that the curve at 7-5 MHz repres- 
ents the complex of transitions between the various M, levels belonging to 
F = 3 and F = 2. Notice that the width of the line at half intensity is about 
2 MHz. 
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Fic. 5. Experimental curves corresponding to the transitions marked in Fig. 4b. 
(After Ritter and Series.) 


The resonance curve at 1-4 MHz is obviously narrower and its interpreta- 
tion is not immediately obvious. It is, in fact, identified by its width and posi- 
tion as a double-quantum transition between the levels F = 0 and 2. Such 
transitions are found when the separate resonances 0O—1 and 1—2 overlap 
and the r.f. field is sufficiently strong. We shall discuss these multiple-quan- 
tum resonances later. 


3.2. HYPERFINE STRUCTURE OF RESONANCES IN STRONG FIELD 

‘* Strong field’’ is, of course, a relative term. The fact that 25 G is a strong 
field in this context (decouples I and J) emphasises the smallness of the hyper- 
fine structures which the method is capable of measuring. 
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Fic. 6. Fine structure of the transition 47S, ,.—5?P3;. showing Zeeman components and 
relative transition probabilities. (After Ritter and Series.) 
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With I and J decoupled, both the optical and the radio-frequency inter- 
actions are with the electron, leaving the orientation of the nucleus unchanged. 
The interactions, showing the changes in M,, are indicated in Fig. 6. The 
resonances corresponding to transitions between the four values of M, would 
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Fic. 7. Hyperfine structure of Zeeman components of 5?P3 /2 Showing the radio-frequency 

transitions detectable by the experimental arrangement shown in Fig. 3. The figure has 

been drawn for a value of the electric quadrupole coupling constant 6 approximately equal 
to that of the magnetic dipole constant a. (After Ritter and Series.) 


be coincident were it not for the hyperfine coupling which produces the split- 
tings indicated in Fig. 7. 

The corresponding resonance curve is shown in Fig. 8. The frequency was 
fixed at 41 MHz and the curve obtained by varying the magnetic field. A low- 
frequency modulation was added to the field in order to allow phase-sensitive 
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detection. The curve is therefore the derivative of the true resonance curve. 
The structure is hyperfine structure. Analysis of this curve allowed the deter- 
mination of the hyperfine interaction constants—magnetic dipole and electric 
quadrupole. 

A recent analysis (Pegg?) has shown that the quadrupole constant ob- 
tained from these curves was seriously in error owing to perturbation of the 
resonances by the strong radio-frequency fields. These perturbations will be 
discussed in a later section. 


20 25 G 


Fic. 8. Experimental resonance curve at 41 MHz (derivative curve). The structure represents 
the eight incompletely resolved transitions shown in Fig. 7. (After Ritter and Series.) 


3.3. BACKGROUND SIGNAL 


In Fig. 9 are compared two signals, one obtained with an oscillating field at 
19 MHz and the other with this field turned off. In the first curve the magnetic 
resonance with hyperfine structure is seen, as in Fig. 8, but a background sig- 
nal is present which the second curve shows to be independent of the radio- 
frequency field. There are two contributions to this background signal: 


(i) It occurs in the region where the J-J coupling is being broken down. In 
this region the atomic wave functions are field-dependent, and the optical 
transition probabilities are also field-dependent. (In the early days of reson- 
ance fluorescence, attemps were made to determine nuclear spins by measur- 
ing these changes: Heydenburg et al.) 


(ii) In this region (and also near zero field) there are degeneracies of energy 
levels. This gives rise to interference effects which are now known as level- 
crossing and anti-crossing effects. (The changes of intensity in Fig. 9 are due 
to anti-crossings because z radiation was used to excite the fluorescence). 
These interference effects form the basis of what has proved to be a most 
important spectroscopic technique. They were not recognised when the 
experiments on potassium were performed: the changes of intensity were 
ascribed to the effect (i). Recent studies (Schmieder er al.) of level- 
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crossing effects in these hyperfine states of potassium have led to values of 
the coupling constants free from the error introduced by the strong 
radio-frequency fields. 


The next sections will be devoted to these interference effects. 
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(o} 
Fic. 9. Fluorescence signal in the range 0-15 G. (After Ritter and Series.) 
a. With radio-frequency field at 19 MHz. 
b. With no radio-frequency field. 


4. LEVEL-CROSSINGS 


4.1. THE PHENOMENON 


First, I should like to show the phenomenon and indicate its usefulness, and 
then discuss its interpretation. 

A sketch showing the essential parts of the apparatus, can be seen in Fig. 10. 
There is no radio-frequency field. Notice that the exciting light is linearly 
polarized perpendicular to the magnetic field, and that the fluorescent light 
(consisting in this case of two circularly polarized components emitted along 
the direction of H) is taken through a linear analyser before reaching the photo- 
multiplier. I emphasise that, for the exciting light and for the fluorescent light 
we are using one mode of polarization (linear polarizers), but that two differ- 
ent atomic transitions are involved (AM = +1), both in excitation and in 
decay. 

The experiment is performed by recording the intensity of fluorescent light 
as a function of magnetic field. Figures 11a and 11b show results for fluores- 
cence from an excited state of thallium ((6s76d)7D3 3). In this particular 
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experiment, low-frequency modulation of the magnetic field was used for 
phase-sensitive detection, so the figures show 61/0H. 

In Fig. 11a you see strong changes of the intensity in the neighbourhood of 
zero field. In 11b you see changes at higher fields. These are superimposed on 
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Fic. 10. Geometrical arrangement for studying the level-crossing phenomenon. 
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Fic. 1la. Resonance fluorescence from (6s76d) ?D3/, state of thallium in the vicinity of 
zero magnetic field. (After Gough and Series.) 
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Fic. 11b. The signal extended to higher values of magnetic field. The sensitivity has been 
increased. The broken line shows the tail of the curve of Fig. 11a. (After Gough and Series.) 


the tail of the zero-field curve which is shown by the broken line, and the gain 
has been substantially increased. 

Now look at the energy level diagram, Fig. 12, and notice that energy levels 
having AM = 2 intersect at zero field and at two other places in finite fields. 
The degeneracy of levels is held to be responsible for the changes in intensity 
of the fluorescent light. Experimentally, one measures the value of the magnetic 
field at the centre of the anomaly and calculates the hyperfine constants with the 
help of the Breit-Rabi formula. 

Measurement of the line width is also important. This is generally done for 
the anomaly at zero field. One can easily calculate the mean lifetime of the 
excited state from the measured line width. 
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Fic. 12. Decoupling of hyperfine structure by the magnetic field. The vertical bar marked 
I’ represents the radiation width of the levels. a is the magnetic dipole constant. The signals 
of Fig. 11b are attributed to the intersections of levels marked with circles. 

(After Gough and Series.) 


4.2. INTERPRETATION OF THE PHENOMENON 


It is essentially an interference phenomenon. When the levels are degenerate 
and under the geometrical conditions specified above, the atoms are excited co- 
herently in two modes and by virtue of this, can radiate coherently in two 
modes. The interference results in a redistribution in space of the fluorescent 
light, compared with the distribution when the modes are excited incoherently. 
The decay rate of the atoms is unaffected: there is no change in the intensity 
of the fluorescent radiation integrated over space. Notice that the redistribu- 
tion of light is attributable here to interference whereas in the double resonance 
experiments it was attributable to the radio-frequency field. 

The possibilities for interference are apparent from Fig. 13. The important 
points are: for the atoms—one initial state, one final state, two intermediate 
states ; for the light—coherence in the incident beam and at the detector. 

We can write simple equations based on the notion of pulse excitation which 
we used before. Suppose an atom is excited at time fy by a pulse of polarized 
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light which allows transitions from a common initial state jg> to either of two 
excited states |x> or |B>. The wave functions for |x> and |B> develop ac- 
cording to exp —i(k,;—4iT)(t— lo) (i = «, B), where k; is the eigenfrequency 
and [; the damping constant. In many applications (for example, in cases of 
hyperfine structure), we have I’, = I'y. The state vector at time ¢ for those 
atoms excited at fg is (to within a constant), 


lt,to> = )) Pig exp—i(k,;—HT) (t—t0) ld, (4) 
i B 


i=a, 


where P,, is the (time-independent) matrix element of the electric dipole op- 
erator between the ground and excited state. (This is written as a plausible 
equation: it can be justified in a number of ways.) The intensity of the light 
emitted at time ¢ in such a direction and polarization as to reach the detector 
will be proportional to |< f[P’|t, tp>|?. Thus 
T(t, to) OC [Ppa Pagl” +1P op Pool”) exp —I'(t~to) 

+ P'rq Pag Pas P op exp — i(d—iT)(t— to) 

+ complex conjugate, (5) 
where d = k,—k,. 


Fic. 13. Resonance fluorescence by two paths from initial state g to final state f. 


In a steady-state experiment we integrate, as before, over fy from 0 to t and 
obtain 


Cy? + C,? r 5 
(C; ia 


I es ee 
7 i rye te pP 


(6) 
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Fic. 14. Zero field level-crossing signals (Hanle effect) for 63P, in mercury. The four curves 
correspond to rotation of a polarizer in successive steps of 7/4. (After Kibble and Series.“”) 
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where Cy = etek walt C, = Peg Peal, and A +iB = Py Pag Pas P gp: (7) 


The first term of (6) is a constant. The second and third are Lorentzian bell- 
shaped and dispersion functions of 6. These are the interference terms, which 
vanish when 6.> I. By suitable orientation of polarizer and analyzer one 
can make either A or B vanish. These results are illustrated in Fig. 14,{ which 
shows a set of experimental curves obtained for the zero-field level-crossing in 
6°P, states in mercury—the states studied in the Brossel—Bitter experiment. 


4.3. DETERMINATION OF LIFETIME 


The width at half intensity (the bell-shaped curve) and the distance between 
the peak and the dip (dispersion curve) is equal to 2T’. For the mean lifetime, 
Tt, we have 


t(sec) = I’ ! (radians per sec) ~ 4 (8) 


The method of studying the zero-field level-crossings is, in principle, very 
attractive for the determination of lifetimes because the sole perturbation on 
the atoms is the incident light, which is a negligible effect in comparison with 
spontaneous emission. But in practice there are a number of difficulties: 


(i) admixture of the bell-shaped and dispersion curves, 


(ii) multiple scattering of resonance radiation. This leads to an anomalous 
narrowing of the curves called “‘coherence narrowing”’. 


(iii) distortions caused by non-uniform spectral distribution of the incident 
light. 

We shall discuss these problems in a later section. In practice, the method 

can be applied to measure lifetimes to a precision of 1 % in favourable cases. 


5. ANTICROSSINGS 


There is nothing in the above analysis which requires the levels to be of dif- 
ferent M—the interference terms still exist if the levels have the same value of 
M. But, in time-independent perturbation theory, “levels of the same M never 
cross’. If we are considering the breakdown of I-J coupling, for example, the 
hyperfine interaction al .J, which has non-vanishing matrix elements between 
states of the same value of M = (M,+M,), causes repulsion of such levels. 
Nevertheless, a pair may approach one another to within their natural width, 
and in this case the interference effect is not negligible. Such a situation is 
called an “anticrossing”.“©® 

In the region of an anticrossing the wave functions of the perturbing states 
are changing rapidly. For this reason it is easier to base an analysis on the 
unperturbed states, say |a> and |b>. The optical matrix elements to |a> and 
|b> vary only slowly through the anticrossing. Calculation of the change in 
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intensity of the fluorescent light shows that the interference effect has its great- 
est value when either |a> or |b> alone are populated. The interference signal 
has the analytical form |2V |?/(y? +67 +|2V |”), where |2V| is the interval be- 
tween the levels at their closest approach (a measure of the perturbation en- 
ergy). This expression should bz compared with eqn (3) which describes a 
double resonance curve. The two have exactly the same form. An anticrossing 
signal is the same as a double resonance signal, but in the one case the pertur- 
bation is internal (|2V |), and in the other, external (yH 1). (See Fig. 15) 


————————————— 


OSA 


Fic. 15. An “anti-crossing” of energy levels and the corresponding fluorescence signal 
whose analytical form is given in the text. Here, y is the damping constant. 


It is to be noticed that anticrossing signals are broader than crossing signals, 


and are not suitable for the measurement of lifetimes. On the other hand, they 
can be analysed to yield the hyperfine interaction constants. 


6. GROUND STATES: OPTICAL PUMPING 


We have described methods which allow the study of excited states of atoms. 
To complete our preliminary review we describe some typical experiments in 
which the ground states of atoms are the focus of interest. Therefore we must 
choose as a suitable substance one whose atoms have structure in the ground 
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states. (The even isotopes of mercury, which served as an example before, 
have a ground state without structure.) 

The alkalis have the required structure. Rubidium serves as an excellent 
example because its vapour pressure at ordinary room temperatures is about 
right for the experiments we shall describe. 


6.1. THE PUMPING PROCESS 


Consider an assembly of atoms which have undergone resonance fluorescence 
by anisotropic excitation followed by spontaneous emission, which is an iso- 
tropic perturbation. In principle, we have injected anisotropy into the system; 
that is, we have altered the distribution of atoms over the ground states. We 
have, therefore, prepared the system for magnetic resonance experiments. 
Points of detail will emerge by considering a simple experiment. (See Fig. 16) 
The bulb contains rubidium vapour at a pressure of about 10~° torr. (It 


Rubidium lamp 


Coliimating lens 


Resonance ce: 


R.F. fieic coils 


D.C. and modulstion 
field coils == 7 


Fic. 16. Apparatus for studying optical pumping in rubidium. 
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also contains a noble gas at a pressure of a few torr for a reason we shall 
shortly explain). It is illuminated with light from a rubidium lamp which pas- 
ses through a circular polarizer. The mounting allows a magnetic field to be 
applied in the direction of the incident light, and also enables this direction to 
be aligned with the Earth’s field, since this is by no means to be ignored. 

The optical transitions are indicated in Figs. 17a and b which, for the sake 
of simplicity, ignore hyperfine structure. It is clear that, if the atoms are excit- 
ed by o* light and decay by spontaneous emission without any other pertur- 
bation, the net result of one cycle will be to transfer atoms from the ground 


5°P 3/2 


5*P 4/2 


7947 R 7800 A 


6°52 
D, Do 


Fic. 17a. Resonance fluorescence of the principal doublet in rubidium. 


| 
| 
| 
| 


Do 


Fic. 17b. Transitions induced by excitation with circularly polarized light. The numbers 
indicate relative transition probabilities. 
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state |—4> into |+4). This may be expressed quantitatively. Starting from 
equal populations of the two states, the D, light alone will produce a 20:40 
mixture, while the D, Jight alone will produce a 10:50 mixture. The two effects 
are additive. (We shall reconsider this point when we investigate the effect of 
the noble gas.) 


6.2. RADIO-FREQUENCY RESONANCE 


Having established a difference of population, we can now contemplate a 
resonance experiment. Coils, as shown in Fig. 16, provide an oscillating field 
at right angles to the direction of the light beam. The required field amplitude 
is only of the order of milligauss, since the atoms have lifetimes of the order 
of milliseconds (we discuss this below). The frequency may be of the order of 
kHz if we are studying transitions between Zeeman components, or of the 
order of 10 to 10° MHzif we are studying hyperfine transitions. Signal genera- 
tors will frequently provide enough power. 


6.3 MONITORING THE RESONANCES 
The r.f. field alters the populations of the levels, and this may be detected 

(i) by its effect on the fluorescent light (but this method will not be applic- 

able if the bulb contains a gas), or 

(ii) by its effect on the absorption coefficient of the vapour. 
This latter method has been most commonly used, and is illustrated in Fig. 16. 
The same light which does the pumping also monitors the resonance. Refer- 
ence to Fig. 17 shows that the amount of circularly polarized light absorbed 
depends on the distribution of atoms between the ground states. The resonance 
is indicated by changes in the current of the photocell placed behind the 
resonance vessel. 


6.4. DISCUSSION OF THE RESONANCES 


Fig. 18 is an oscilloscope trace of resonances obtained with apparatus of this 
sort. It is rather a crude picture, but it brings out important points. Notice the 
following: 


(i) stray fields have been compensated to about 2 mG. The only remaining 
field is the longitudinal field shown on the scale; 


(ii) R.f. frequency: 5 kHz. Amplitude: a few mG; 
(iii) there is a peak at zero field which needs to be explained; 


(iv) resonances at + 10-7 mG are due to ®°Rb, at +7-0 mG due to 87Rb; 
(natural abundances 72%, 28%; nuclear spins 5/2, 3/2; g,, 1/3, 1/2; 
respectively); 
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(v) resonance line-widths are ~2 mG (®*Rb)~ 1 kHz, sothat t ~ 107* sec 
(assuming the line is homogeneously broadened); 


(vi) the small wiggles are due to pick-up from the mains, showing that the 
time for traversing the sweep was of the order of seconds. This is 
necessary in order to avoid distorting the resonances, and emphasises 
the relatively large value of t. 


Consider point (iii). Near zero field the energy levels are degenerate or near- 
degenerate. There are stray transverse static fields and also transverse, low- 
frequency oscillating fields from the electrical mains. These can stimulate a 
resonance at zero frequency (or 50 Hz). These stray fields can account for the 
peak at zero field. 

Some conceptual difficulty may still remain. While it is easy to use quantum 
labels M, for non-degenerate levels, and to understand transitions between 
them, can one still use such language when the levels are degenerate? The 
answer is yes, for one can choose a quantum axis arbitrarily, and describe any 
experiment thereby, provided one takes proper account of the perturbations. 
Here, we choose an axis along the light beam and describe the optical transi- 
tions with reference to that axis. We notice that the magnetic field may be 
skew to the axis and we use a formalism which takes account of that, namely, 
magnetic resonance transitions at zero frequency. We could equally well 


ee) 


Fic. 18. Oscillogram demonstrating radio-frequency resonances in the ground state of 
rubidium studied by optical pumping. Resonances at +7mG correspond to ®’7Rb; those 
at +10.7mG to ®*Rb; for other features see text. 
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Fic 19a. Optical pumping signal in rubidium. No r-f field. 


Fig. 19b. Resonance at a frequency of 3 kHz. 
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Fic. 19 c, d. The effect of increasing r-f power: resonances at multiples of the field Ho. 
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describe these transitions by writing equations for rotating the axis from the 
longitudinal to the skew direction. Alternatively, we might choose an axis in 
the skew direction and describe the optical transitions with reference to such 
an axis, but we would have to remember that, on account of the degeneracy, 
there may be coherence between different optical transitions. 

The fields of work opened by resonances such as shown in Fig. 18 are: 


(a) precision measurement of g-values of atoms in ground states, (and, cor- 
respondingly, of hyperfine intervals) ; 

(b) measurement of nuclear spins. (The values of g, may be determined 
from the measurements. Then, if g, is known, I may be deduced); 


(c) precision magnetometry and atomic clocks; 


(d) studies of relaxation processes by measurement of the line width under 
different conditions. We shall elaborate on this. 


6.5. MULTIPLE QUANTUM RESONANCES 


We now illustrate in Fig. 19 an effect which can arise very easily in ground 
state optical pumping experiments, and can lead to mistaken identification of 
resonances. This is the occurrence of what appear to be harmonic resonances. 

The magnetic field sweep is the same as in Fig. 18, but the zero has been 
displaced to the right. The radio-frequency has been reduced from 5 to 3 kHz. 
The figures a, b, c, d are taken with increasing r.f. power, starting from zero. 
Curve (a) is the zero-field resonance. In (b) is seen the resonance of ®°Rb, as 
in Fig. 18 but at a field H, about 6-4 mG. The resonance of ®’Rb is unresolved. 

Increasing r.f. power, Figs. 19 (c) and (d), brings up resonances at approxi- 
mately 2H, and 3H,. (They are not exactly at integral multiples of Hy, but the 
small displacements are not our present concern). These might have been due 
to harmonics in the oscillator, but this possibility was excluded. They are 


Fic. 20. The resonances interpreted as multiple quantum transitions. 
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Fic. 21a. Resonance at 5 kHz produced by rotating field. (The zero field signal is also 
present). 


Fic. 21b. Effect of increasing r-f power. The dominant resonance is undisplaced but 
greatly broadened. Numerous subsidiary resonances appear; the zero field signal is dis- 
placed slightly to the left (see Fig. 42). 
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interpreted as multiple quantum transitions between a pair of levels having 
AM = +1, and require for their interpretation the existence of components of 
the r-.f. field other than the rotating field which is responsible for the primary 
resonance at Hy(o*: AM = +1 in absorption). 

Figure 20 shows how acomponent of the oscillating field parallel to the static 
field (x: AM = 0), combined with o*(AM = +1), can satisfy the conserva- 
tion of energy and of angular momentum at a field 2H, while the counter- 
rotating component o (AM = —1 in absorption) can combine with 20+ to 
satisfy the conservation laws at 3H. 

In Fig. 21a. is a resonance curve taken with an arrangement of coils design- 
ed to produce a rotating field (o* alone). The frequency is now 5 kHz and 
zero field has been shifted to the centre. The zero field resonance is seen as well 
as the resonance at about 10 mG due to the rotating field. This should be 
compared with Fig. 18 where the resonances appear on both sides of zero 
because an oscillating field was used. 

In Fig. 21b. the r.f. field strength has been greatly increased. A variety of 
resonances appear at fields up to 5H on the left and 4H, on the right. These 
are due to small o” and z components arising from the non-uniform field of 
the coils, and acting with the o* component to generate the resonances. 

These multiple quantum transitions should not be confused with the multi- 
ple quantum transitions described above (Section 3.1 and Fig. 4b). In the 
present case we have transitions between atomic eigenstates induced by dif- 
ferent components of the r.f. fields, acting coherently. In the former case we 
had transitions via non-resonant intermediate states of the atom induced by 
one particular component of the field. 

These multiple quantum transitions have been studied by a number of 
authors. !% 


6.6. WIDTH OF RESONANCE CURVES: RELAXATION TIMES 


We have been concerned with the occurrence of resonances. We now return to 
the question of their width. 

One might expect the width to be determined by the inverse of the time of 
flight of atoms across a resonance cell (~10~*sec, for example). In fact, 
resonance curves are frequently studied whose width corresponds to times of 
the order of I sec. This is generally achieved by using wall coatings or buffer 
gases. The important fact is that, in many cases of interest, it is possible for 
atoms to undergo many thousands of collisions either with gas molecules, or 
with the coated wall, without losing their orientation. This is generally true for 
atoms whose ground states are S states (such as the alkalis). Their orientation 
arises from the electron and nuclear spin, but the spherical symmetry of the 
distribution of charge isolates the spins from the effects of collision. 

Substances chosen for wall coatings are chemically inert materials such as 
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paraffin hydrocarbons. Similarly, noble gases at pressures from a few torr up 
to atmospheres are used for buffer gases. 


6.7. PUMPING CYCLE WITH BUFFER GASES 


The use of buffer gases has important consequences for the mechanism of the 
pumping process, since excited states (which will be P states if the ground 
states are S) are not inert under collisions with buffer gases. When the pressure 
of the gas is raised to the point where the time between disorienting collisions 
is comparable with the radiative lifetime of the pumped atoms, the changes of 
population induced by optical pumping are radically changed. The situation is 
usually analysed by assuming that collisions result in the random orientation 
of excited atoms—equal population of the states |M,> or |M,>. This is an 
oversimplification, but it gives useful qualitative results. 

If all excited states are equally populated and decay at the same rate, then 
all the ground states are replenished at the same rate (whatever the coupling 
scheme). It follows that differences of population between ground states can 
only be established through different rates of excitation and this raises a dif- 
ficulty when the ground states are S states. The intensity sum rules guarantee 
that if the absorption from an S state is summed over a multiplet (e.g. S,/. to 
P12 and P3/,), the transition rates from all components of the ground state 
are equal. (This may be verified by adding the transition probabilities in Fig. 
17). Therefore, in order to achieve differences of population one must have 
light whose spectral distribution over the fine structure components is non- 
uniform. This is commonly achieved by the use of interference filters or absorp- 
tion cells. Under some conditions the required change of spectral profile oc- 
curs as the pumping light traverses the vapour in the resonance vessel. 


6.8. HYPERFINE PUMPING 


Hyperfine pumping is another variant of the optical pumping method which 
relies on non-uniform spectral distribution of the pumping light. Suppose the 
hyperfine components in an absorption line are optically resolvable (reson- 
ance lines of rubidium, for example), it is then sometimes possible to find a 
lamp (often an isotope of the same element) whose spectrum covers the hyper- 
fine components unequally (®°Rb (lamp) and ®’Rb (sample)). In such a case, 
light of any polarization will produce an inequality of population between 
hyperfine states of the ground level. The hyperfine intervals in zero field can 
now be studied by magnetic resonance, just as for the excited states. The re- 
laxation mechanisms for hyperfine transitions may be different from those for 
Zeeman transitions. The intensities of the resonances are a measure of <LJ). 


6.9. SPIN EXCHANGE 
We have described how atoms can be oriented by light. An important exten- 
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sion of the technique due to Dehmelt was the discovery that atoms of one 
species can be oriented through collisions with atoms of another species which 
themselves have been oriented by light. Thus, for example, with a mixture of 
sodium and rubidium vapours in a vessel, the sodium can be oriented by il- 
luminating the bulb with rubidium light. Moreover, r.f. resonances in the sod- 
ium may be monitored with the rubidium light because the sodium atoms 
which have interacted with the radio-frequency field themselves interact with 
the rubidium atoms. 

An important application of this spin-exchange technique is in the study 
of substances which cannot themselves be oriented by light, or for which there 
are technical difficulties. Among the substances so studied are atomic hydro- 
gen (and its isotopes) and free electrons. 


6.10. METASTABILITY EXCHANGE 


Closely related to spin-exchange is metastability-exchange, first studied by 
Colegrove et al.) We shall illustrate this and the following sections by refer- 
ence to the experiments of Greenhow.“ 
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Fic. 22, Term diagram of *He showing optical pumping from the metastable state 23S,. 
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Optical pumping is performed from the metastable state 2°S, which is popu- 
lated (with random orientation) in a discharge in helium gas at a pressure of a 
few torr (see Fig. 22), Il[umination with circularly polarized light at 10,830 A 
or 3889 A from a helium lamp produces polarization of the metastable atoms 
through the intermediacy of the excited states 2°P or 3°P respectively. The 
polarization is reflected in the population of the hyperfine states of 27S,, and 
if the J-J coupling is not broken down by an external field, the nuclei will 
also be polarized. When these metastable atoms collide with the much more 
numerous atoms in the ground state, 1'So, there is a probability that the exci- 
tation energy will be transferred without affecting the nuclear polarization. 
Such collisions establish a nuclear polarization in the ground state. 


Experimental curve 
Tg = 2°3 Hz 
325 330 335 
mG 
Fic, 23a. Nuclear resonance at a frequency of 1 kHz in the ground state of 3He (this 
curve was taken with modulated light: see Section III 6.3). 
(After Partridge and Series. Proc, phys. Soc. 88, 983 (1966).) 


Pee enka v/2 H=0 FalV/2 oe 


Fic. 23b. Resonances at 1 MHz in the metastable state. It is to be noticed that the line 
width is nearly two orders of magnitude greater than in (a). 
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It should be appreciated that this nuclear polarization is in a state in which 
the electronic angular momentum is zero: the magnetic moment of the atoms 
resides solely in the nuclear magnetisation. (For the metastable states, the 
electronic angular momentum is non-zero). 

3He atoms in their ground states have extremely long relaxation times. (Ten 
hours has recently been measured by Cohen-Tannoudji et al.'*»). In the gas 
discharge, relaxation times are of the order of minutes. The resonances and 
the relaxation times can be studied by measuring the absorption of light by the 
metastable atoms, though these are not the only methods available. 

Figure 23a shows a nuclear resonance in *He at a frequency of 1 kHz. 
(Actually, this curve was taken with modulated light, but it demonstrates the 
characteristics of the resonance). The g-factor, about 2:3 x 10° °, shows that 
it is a nuclear resonance. 

For comparison, Fig. 23b shows a set of resonances, also monitored by 
absorption of light by the metastable atoms, where the applied frequency is 
1 MHz. The resonances occur at values of the field about } and 4 G respec- 
tively, whereas that in Fig. 23a is at about 4 G. Yet the frequencies in the 
two cases differ by a factor 10°. The resonances in (23b) are Zeeman reson- 
ances in the hyperfine states F = 1/2 and 3/2 of 2°S,, where the g-factors, 8/3 
and 4/3, are of electronic magnitude. 


7. RELAXATION PROCESSES 


In many applications of optical pumping it is of interest to study relaxation 
processes, for this gives information about the interactions between the optic- 
ally pumped material and the collision partners. Methods available are static 
methods (e.g., measurement of resonance line width) and the study of trans- 
ients. The number of time constants required to interpret the measurements 
depends on the complexity of the system and the symmetry of the environ- 
ment. For a spin-} system in an anisotropic environment two times are 
sufficient: the longitudinal (T,) and transverse (7,) relaxation times which are 
familiar from the Bloch equations of magnetic resonance. 

The relaxation times will often be measured in a situation where the optical 
pumping process itself contributes to the relaxation: atoms are suddenly per- 
turbed, either by optical excitation, or by collisions, according to the mechan- 
ism of pumping. The time constants measured will then be functions of the 
mechanism. These times we call t, and t,. They are related to T, and T, by 
equations of the form 
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where T, ... are time constants for the pumping mechanisms, and K,, K,’... 
are numerical constants depending on these mechanisms. 
We give examples of the measurement of t,, T, and 15. 


7.1 RELAXATION WITH THE PUMPING LIGHT: T, 


Interpose a shutter between the pumping lamp and the cell. Suddenly with- 
draw the shutter and record the exponential change of intensity towards its 
equilibrium value (Fig. 24a). The time constant is 7. 


7.2. RELAXATION IN THE DARK: T, 


When equilibrium has been reached, interpose the shutter for a definite time 
(t; —t)). During this time the sample has relaxed in the dark with time con- 
stant T,. On removal of the shutter one obtains again the pumping curve 
(t,), but the initial point gives the polarization after the lapse of time (t, — to). 
From the locus of these points (Fig. 24b) one obtains T,.” 


f a t/r4 


Light off on ——+> t 


Fic. 24a. Exponential growth of optical pumping signal. (Time constant 7). 


on off on 


Fic. 24b. To illustrate the method of obtaining points on the curve for relaxation in the 
dark. (Time constant 7). 
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40 msecs 
Nutation af resonance 
 w@,/ 2a V7 Hz @ = 85 msecs 


Fic, 25. Nutation at resonance of *He in the ground state. 
a. Weak r.f. field. 


40 msecs 
Nufation at resonance 


@,/2T7 =375 Hz @= 64 msecs 


b. Stronger r.f. field. 
The time constants for relaxation can be obtained from the damping of these curves. 
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7.3. MONITORING LAMP 


An alternative procedure is to use separate lamps for pumping and monitoring. 
The latter should be weak enough so that its pumping and relaxing effect is 
negligible. If such a lamp is used the indirect technique described above is 
unnecessary. The relaxation curve (T,) can be observed directly. 


7.4. NUTATION: MEASUREMENT OF 7, 


The transverse relaxation can be studied by observing the nutation as in Fig. 
25. The sample is pumped to equilibrium and the r.f. field, previously adjusted 
to the resonance frequency, is suddenly switched on. The longitudinal mag- 
netization nutates—i.e., precesses round the effective field which is rotating in 
the x—y plane (classical picture of magnetic resonance). This nutation is damp- 
ed at the rate 07 ' = 3(t,~'+7,73). 

Figures 25a and b relate to >He pumped by the process of metastability 
exchange and monitored via the metastable state. The relaxation time con- 
stants in these examples are 64 and 85 msec. The frequency of nutation, 375 Hz 
in the one case and 17 Hz in the other, is, at resonance, proportional to the r.f. 
field strength, and measurement of this frequency is a good method for deter- 
mining this field strength. 


8. FREQUENCY SHIFTS 


Associated with the relaxation processes we have mentioned are dispersive 
effects which may be manifested as frequency shifts. Thus, there is a frequency 
shift which arises from the irradiation of atoms by light, and one which arises 
through collisions between the optically pumped atoms and buffer gases. It 
will be appreciated that such shifts may give rise to serious errors in the preci- 
sion spectroscopy to which optical pumping lends itself. On the other hand, 
the shifts allow a study of interaction processes. 


8.1. PRESSURE SHIFTS 


Figure 26 shows the shift of the zero-field resonance line of 1*3Cs as a func- 
tion of pressure of foreign gas. It will be noticed that both positive and nega- 
tive shifts occur. With an appropriate mixture of gases, the shift may be eli- 
minated. 


8.2. LIGHT SHIFTS 


Figure 27 shows the shift of the Zeeman resonance, M, = 4 — —4, in the 
ground state of '°°Hg. The central curve is the resonance obtained under 
ordinary optical pumping conditions (irradiation with circularly polarized 
*9°Hg resonance radiation). The curves on either side show the displacement 
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obtained by irradiating the vapour in addition with o* or o~ light from a 
different isotope, ?°' Hg. 


Frequency 
MHz 
Ho 
9 192,640 ue 
No 
9 192,635 Ne 
9 192,632 ee 
9 192.630 Pressure 
(@) 5 mm Hg 
Kr 
9 192.625 
Xe 
9192,620 


Fic. 26. Pressure shift of zero field hyperfine resonance line of 133Cs for different buffer 
gases. (After Arditi and Carver.‘?) 


Resonance intensity 
o Without second beam 


1% beam ot 


X Second beam o* 


4 Second beam g_ 


a 
0-5 Hz 
> Ho 


Fic. 27. Resonance curves in the ground state of 1°°Hg demonstrating light shifts. (After 
Cohen-Tannoudji.”) 
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One chooses light from a different isotope because the spectral components 
of 7°*Hg do not coincide with the absorption region of !°°Hg. The signifi- 
cance of this is the following. The dispersion curve which is associated with 


Spectral 
profile of 
lamp p(k) 


ko 


Dispersion 
(sample) 


Fic. 28. The light shift is obtained by folding the spectral profile of the lamp with the 
dispersion curve of the sample (Section I.8.3). 


absorption is antisymmetrical about the mid-point. The displacement of the 
energy level is given by folding the spectrum of the lamp with the dispersion 
curve and integrating over frequencies (Fig. 28). The result is zero if the lamp 
spectrum is symmetrical with respect to the absorption, but non-zero if it is 
asymmetrical. 

The displacements seen in Fig. 27 are not the total displacements of the 
levels, but the differential displacements between the levels M, = 4 and —34. 

It might appear that light shifts could be avoided by using, for irradiation of 
a sample, only light from the corresponding lamp, but this is not necessarily 
true. The spectra of pumping lamps are rarely simple lines which fall symmet- 
rically across the absorption spectrum of the sample. Any structure in the lamp 
which might arise, for example, from hyperfine or isotope effects, is potentially 
a source of light shifts. On the other hand, the shifts themselves are small, and 
of significance only in precision measurements. 


8.3. SEMI-CLASSICAL INTERPRETATION OF LIGHT SHIFTS AND RELAXATION 


A detailed account of the optical pumping cycle has been given by Barrat and 
Cohen-Tannoudji.¢» But the following semiclassical interpretation of light 
shifts and relaxation, due to Pancharatnam‘,'© is helpful. 

The energy acquired by an atom of polarizability « in an electric field E is 
—41E?. This holds also for an oscillatory field. For the complex oscillatory 
field E, exp —ikt we have a, = (e?/2mkg) f/(ko —k —4iT), where ky is a reson- 
ance frequency, I’ the corresponding damping constant, and f the oscillator 
strength. 
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The (complex) energy of interaction with radiation of energy density p(k) is 


AW = —Y uIE,|? = —22 | a, p(k) dk. (9) 
k 0 


If we give this classical expression a quantum interpretation, the real part of 
AW /h gives a level shift 6 and the imaginary part, the damping due to the in- 
cident field. (This is the damping constant for the amplitude: the damping 
constant for the energy, I’, is twice this value). T’ should be distinguished 
from IT, which is the damping constant for spontaneous emission from the 
excited state. 

For the simple case of a transition between states |a> and |b>, we have 


Soa = (2m[he?)ko|P yal” (10) 
and Xba = [Pal /h(ko —k —4iT), (11) 


where P,, is the matrix element of the electric dipole operator. Thus 


12) 
_ = 2n\P sl? (ko ~k) 
6= Se | oO ears oF 
and 
13) 
,_ AmlP yal? ie 40 ( 
Ma Jo OE =? + Gry 


which are essentially the formulae derived by Barrat and Cohen-Tannoudji, 
illustrated in Fig. 28. 


9. LEVEL-CROSSING IN GROUND STATES 


An essential ingredient of the level-crossing effects in excited states (Section 4) 
was the preparation of atoms in superposition states. This is the condition we 
must look for if we wish to study level-crossing effects in ground states. 

It can be achieved in the following way. Suppose we set up an optical pump- 
ing experiment to produce a polarized system. This can be done by illuminating 
a sample with circularly polarized light in zero field (Fig. 29). 

If the system is space-quantized with reference to the direction of the light 
(Ox), there exists a population difference between the states, but there is no 
correlation between them. However, if we space-quantize with respect to an- 
other direction (Oz), each of the new states is to be expressed as a superposi- 
tion of the former states, so that optical pumping along Ox has produced a 
superposition of states defined with reference to Oz. 
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If now a magnetic field is applied along Oz, the states defined with respect to 
Oz are energy eigenstates. They are degenerate in zero field, but the field 
removes the degeneracy. This is precisely the level-crossing situation we met 
in Section 4. 

It remains to monitor the effect of removing the degeneracy. This is done 
by measuring the absorption of the pumping beam in the system. (It may also 
be done by using a separate, weak beam in the Oy direction, or by measuring 
the fluorescent light). One can obtain curves similar to the level-crossing curves 
of Fig. 14, for example, but of width corresponding to the relaxation time in 
the ground state. 


Ge TD 
f 


Fic. 29. Geometrical arrangement for studying the Hanle effect in absorption. 


The situation we have just analysed in terms of superposition states and 
interference effects is sometimes described in terms of the transverse magnetis- 
ation of the system. The pumping beam creates a magnetisation along Ox. If 
a field H exists, the magnetisation precesses around Oz. Since different atoms 
are brought into the field at different times, the net magnetisation is spread 
out like a fan in the x~y plane. Between the extreme situations—(a) H = 0; 
magnetisation along Ox, and (b) H > (yt) *; magnetism uniformly spread in 
the x-y plane—is the transition region which is precisely the region where the 
degenerate Zeeman levels are becoming resolved—the level-crossing region. 


10. DESCRIPTION OF OPTICAL PUMPING IN TERMS OF 
IRREDUCIBLE SPHERICAL TENSOR OPERATORS 


It should be remarked, in conclusion, that a description of the interaction of 
atoms with light in terms of magnetisation, though often extremely illuminat- 
ing, is not capable of representing all the phenomena. It can represent only 
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those phenomena which can be analysed in terms of spherical tensors of 
multipole order 1. Optical interactions by electric dipole transitions can also 
monitor atomic properties described by tensors of multipole order 2, for which 
the appropriate model is a precessing and pulsating ellipsoid. Such a model has 
been described by Pancharatnam. The limitation to order 2 stems from the 
selection rule AM = +1 for electric dipole transitions, and may be under- 
stood with reference to Fig. 13. Tensors of higher multipole can, in principle, 
be monitored by transitions of higher multipole or by dipole transitions with 
more intermediate steps. 

It is becoming increasingly common to analyse optical pumping experiments 
by means of irreducible spherical tensor operators, but no account of this work 
will be given in this chapter.{7: 18 19 


Il. EXPERIMENTAL DETAILS AND POINTS 
OF TECHNIQUE 


We describe here further experimental details and points of technique. 


1. EXPERIMENTAL DETAILS 
1.1. THE SAMPLE 


Two methods are in common use for containg the sample, sealed-off cells and 
crude atomic beams. 


a. Sealed-off cells 


Pyrex or silica cells are commonly used, of linear dimensions between 1 and 
5 cm. Spherical or cuboid shapes are used. Sometimes the cells are extended 
in the form of a horn for a light trap. Good optical quality is not essential. 

The vapour under study is normally at a pressure of the order of 1075 or 
107° torr. 

Details of the application of wall coatings have been given by Bouchiat and 
Brossel.2° 

The vapour pressure of the sample is commonly controlled by the use of a 
side arm and thermostat. Special methods have been used for reactive mater- 
ials. (See, for example, Minguzzi er al.?) 

When foreign gases are used in optical pumping experiments a wide range 
of pressures varying from, say, 10~? torr up to pressures of the order of 1 
atmosphere may be used. (Optical pumping of rubidium in argon buffet gas 
at pressures up to 30 atmospheres has recently been reported.(?) It is impor- 
tant to take account of the broadening and displacement of the absorption 
lines of the sample when high pressures of foreign gas are used.) 
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b. Atomic beams 

The sample is used in the form of a beam when there is reason to fear conta- 
mination in sealed-off cells, or when it is desired to study materials which can- 
not conveniently be maintained as a vapour in quartz vessels. The object of 
the beam is not to secure collimation. Beams have been used in all-glass sys- 
tems, or in metal systems with windows.?» 


1.2. LIGHT SOURCES 


The main requirements are that the light sources should be stable and produce 
a high flux in the spectral absorption region of the sample. In practically all 
cases this has been achieved by using a lamp whose output is a line rather than 
a continuous spectrum. The energy density per unit spectral range is generally 
insufficient in sources with a continuous spectrum. Notwithstanding this, 
optical pumping with white light has been performed under circumstances 
when the absorption line was strongly pressure-broadened,*” and resonance 
fluorescence from metastable states of neon using sunlight is described by 
Wood in his book on Physical Optics.°?*? While certain accidental coinci- 
dences of spectral lines sometimes allow the optical pumping of one element 
by a lamp made from another, it is most common to use in the lamp the same 
element that is to be studied in the sample. Use is frequently made of sepa- 
rated isotopes to secure differential pumping of hyperfine or Zeeman com- 
ponents. Very little use has been made of lasers in this field. 

Insofar as the sample needs to be irradiated by light which can be absorbed 
by atoms in their ground or in metastable states, it follows that the spectral 
lines emitted by lamps of the same element are very commonly self-reversed. 
That is to say, the outer layers of vapour in the lamp may absorb from the 
broadened spectral line emitted from the core of the lamp, just those regions 
in the middle of the spectral profile which are needed in the sample. Much 
effort has gone into the production of lamps free of self-reversal though it 
should be appreciated that a high degree of self-reversal can be tolerated, 
provided there is still sufficient energy at the centre of the spectral line. Thus, 
many forms of commercial sodium lamp show a high degree of self-reversal in 
their spectrum but have nevertheless often been used for optical pumping. 

The disadvantage in irradiating the sample with a large flux of light out- 
side its absorption region (as is the case when a strongly self-reversed lamp is 
used) is that the stray light scattered from the walls of the resonance vessel is 
unnecessarily increased. This is objectionable since the main source of noise in 
optical pumping experiments is normally photoelectric shot noise. 

The different types of lamp which have been successfully used in optical 
pumping experiments include hollow cathode lamps, (internal electrodes) and 
sealed off electrodeless lamps excited by radio-frequency or microwave oscil- 
lators. A discussion of light sources in general has been given by Budick, 
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Novick and Lurio.@* Bell et al.,¢® describe the production of small but ef- 
fective alkali metal lamps. Efficient alkali metal lamps of larger surface area 
have been described by Atkinson, et al.,(?7 Burling and Czajkowski?® and by 
Berdowski et al.'2®) The lamp described in the latter paper is suitable for use 
in high magnetic fields. 

Hollow cathode lamps of advanced design, carrying high current and al- 
lowing the application of high magnetic fields, are being used at Heidelberg by 
Otten and zu Putlitz.6” 

When sealed-off lamps are prepared for excitation by microwave sources 
the cells are normally flattened in order to reduce self-reversal. A typical cap- 
sule much used by the group in Paris is about 2 cm in diameter and 1 mm thick. 
Such capsules are placed near an antenna from a microwave oscillator which 
may typically be a magnetron delivering 100 watts at 2-5 GHz, (such units are 


_Sr@ (a) 


(a) 


Fic. 30. Some typical light sources: sealed-off cells. 


a. Disc excited by antenna from microwave oscillator. 

b. Flattened cylinder forming part of co-axial line. 

c. Cylinder with re-entrant window in coil of oscillator at about 100 MHz. 
d. Cylinder in coil of oscillator at about 10 MHz. 


438 G. W. SERIES 


produced for medical purposes). Alternatively, if the capsules are prepared in 
the form of a squashed cylinder a few centimeters long and a millimeter thick 
they may be incorporated as part of the central conductor in a co-axial line 
driven by a similar oscillator. Capsules used for these purposes will contain a 
noble gas at a pressure of about | torr together with the material whose spec- 
trum is to be excited. It is sometimes necessary to provide a thermostat so that 
the pressure of the vapour may be controlled independently of the current in 
the lamp. 

A microwave-excited lamp suitable for exciting calcium and magnesium 
ions has been described by Smith and Gallagher.@") (See Fig. 31). 


1.3. MAGNETIC FIELDS 


It is rarely necessary in optical pumping experiments to use high magnetic 
fields; the requirements are generally to produce relatively small magnetic 
fields of high homogeneity. The amount of non-homogeneity which can be 
tolerated depends on the line width of the resonance being explored. For zero 
field level-crossing experiments in excited states, for example, where typical 
line widths might be of the order of a few gauss, stray fields in the laboratory 
are often a significant perturbation. For optical pumping experiments in 
ground states it is essential to compensate stray fields. This is commonly done 
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Argon flow (pressure 2-6 torr.) 


Fic. 31. Flow lamp excited by microwave antenna. (After Smith and Gallagher.) 
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with orthogonal pairs of Helmholtz coils of diameter of the order of 30 cm. 
Difficulties in obtaining circular formers of this size may be avoided by wind- 
ing rectangular coils with formers made of channelled material. 

The design of coils and solenoids to produce uniform magnetic fields has 
been discussed by Garrett” and Franzen.°* 

It is possible without too much difficulty to reduce stray static magnetic 
fields to a few milligauss by the use of large Helmholtz coils. Oscillatory 
magnetic fields (at the frequency of the A-C mains) are commonly present in 
the laboratory with amplitude about 2 mG. Attempts to compensate such 
fields by servo-mechanisms have been rather unsuccessful. More commonly 
these stray fields have been reduced by shielding. 

In a recent experiment Dupont-Roc et al.S studied resonances of width 
about | 4G for which the sample was enclosed within four concentric cylin- 
ders of Mu-metal. 


1.4. DETECTORS 

Photocells and photomultipliers are commonly used together with electric 
amplification and recording, although in some of the early work sensitive 
galvanometers were used very successfully. Many investigations have been 
successfully performed with a single photodetector. This is possible only when 
one can rely on the stability of the lamp. 

Compensation for drifts in the lamp can often be effected by taking fluo- 
rescent light into two photodetectors in such positions that the two signals are 
in opposite senses. By the use of an optical bridge or by some other arrange- 
ment, the signal can be doubled and the effects of drifts eliminated to a first 
approximation. 

In simple cases level-crossing and double resonance signals can be a sub- 
stantial fraction of the tctal fluorescent light. In such cases the signal can be 
displayed on an oscilloscope with wide-band amplification. More commonly 
narrow band methods of detection are used to rescue the signal from noise: 
a typica] system would use phase sensitive detection with time constants of a 
few seconds, but in the more complicated cases which are currently being 
studied the signal is a very small fraction of the fluorescent light and signal 
averaging methods are increasingly being used. 

A novel method of detection, applicable to short-lived radioactive elements, 
is described in a later section (II 6.4). 


1.5. FILTERS AND POLARIZERS 

It has been explained that optical pumping is sometimes more advantageously 
carried out by filtering some of the components of the light from the lamp. 
Interference filters are commonly used to separate the alkali doublets, for 
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example. Lyot (polarization) filters are also used for this purpose. Hyperfine 
components can sometimes be suppressed by introducing into the pumping 
beam a cell of a different isotope of the element concerned, to act as a differ- 
ential absorber. 

Sheet polarizers (linear and circular) are often adequate but there are oc- 
casions when more complete polarization is required and in such circumstances 
crystal polarizers are used. While one-beam polarizers (e.g. Glan prisms) are 
attractive, greater aperture can be achieved with two-beam prisms (e.g. 
Rochon). The unwanted beam is focussed and rejected by a stop. 


2. SPECTRUM OF THE LAMP 


Our analysis, in Section I, of double resonance and level-crossing experiments 
was based on the assumption that the spectrum of the lamp was uniform 
across the whole of the absorption region of the sample. There are occasions 
when this approximation is no longer valid, in particular when the absorption 
spectrum is being shifted by the application of magnetic fields. In such cases 
the profile of the resonances may be distorted. This is particularly serious when 
line profiles are being studied with a view to measuring lifetimes. The reson- 
ance line profiles may then be calculated by an analysis based on resolution of 
the spectrum of the lamp into its Fourier components. The signal due to mono- 
chromatic components interacting with stationary atoms is then to be inte- 
grated over the spectrum of the lamp and the Doppler absorption profile of the 
sample.©> 3° 


3. EXCITATION BY ELECTRONS AND OTHER PARTICLES 


There are sometimes advantages to be gained by exciting atoms with electrons 
or other particles rather than by light. So long as some departure from the 
statistical population of excited states can be achieved by such methods of 
excitation, radio-frequency resonance and level-crossing phenomena can be 
monitored by changes in the fluorescent light. 


3.1. ELECTRONS 


Electron excitation has been developed particularly by Pebay-Peyroula and 
his colleagues.°”) An account of recent work in this field is given in the Pro- 
ceedings of a Conference held recently in Grenoble.G® 

A greater variety of excited states can be reached by electron excitation as 
compared with optical excitation, but there are two major difficulties in the 
use of electrons. One is that the fluorescence being detected is often affected 
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by decay from higher levels in cascade. The other is that the electron trajec- 
tories are perturbed by the applied magnetic fields which by the nature of the 
experiments are required to be varied. Very pronounced resonance effects 
occur when the cyclotron frequencies of the electrons in the static field ap- 
proach the frequency of the oscillating field. In order to avoid these effects it 
is arranged, where possible, that the electrons should travel parallel to the static 
field and that the path length should be such that off-axis electrons are unable 
to perform a complete cycle of cyclotron motion in the course of their flight. 

Level-crossing experiments require that the electron beam be transverse to 
the magnetic field (this is a symmetry requirement corresponding to the optical 
arrangement in which the incident light is polarized at right angles to the mag- 
netic field). Here the deflection of the beam by the magnetic field is particu- 
larly serious. The change in deflection as the field is varied gives rise to distor- 
tion of the level-crossing line shape. 

While most experiments with electron excitation have been performed with 
a heated filament as a source of electrons, some recent work has been per- 
formed in sealed-off vessels where the electrons have been generated in a gas 
discharge at low pressure. The vessel is placed between condenser plates at 
the end of Lecher wires. The motion of the electrons under these circum- 
stances is dominantly an oscillation, perpendicular to the plates.°* 


3.2. HEAVY PARTICLES 


The use of heavy particles rather than electrons reduces by many orders of 
magnitude the difficulties connected with the deviation of electrons by the ap- 
plied magnetic fields. Level-crossing experiments in helium excited by H,* 
have been successfully carried out by Kaul.4@” A quantitative study of the 
polarization of fluorescent light from helium excited by protons has been 
made by Krause and Soltysik.4 

Selective excitation can also be achieved in beams of fast particles which 
have traversed a thin foil. 


4, COHERENCE NARROWING, PRESSURE BROADENING AND DEPOLARIZATION 


Our discussions of double resonance and level-crossing experiments have 
hitherto supposed that the signals correspond to the single scattering of light 
by atoms. If the vapour pressure is low enough the experimental results do, 
in fact, correspond to single scattering, but as the vapour pressure is increased 
the light in the resonance vessel may undergo multiple scattering. The first 
consequence of this is that the light becomes depolarized with consequent loss 
of signal. But strong signals may nevertheless still be obtained and it is found 
that, contrary to expectations, the resonance curves and level-crossing curves 
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become narrower with increase of vapour pressure. As the pressure is increased 
still further the curves begin to broaden owing to the reduction in lifetime of 
the excited atoms through collisions. 

The narrowing was discovered by the group in Paris and interpreted by 
them as a consequence of the coherent diffusion of li ght through the vapour. 
The phenomenon is called “‘coherence narrowing”. A detailed experimental 
and theoretical study was made by Barrat.”) It has since been discovered 
that the coherence narrowing is accompanied by a shift in the magnetic reson- 
ance frequencies.“ 434 44) 

It is necessary to be aware of the phenomenon of coherence narrowing when 
experiments are undertaken to determine the lifetime of excited states from 
level-crossing curves. Such curves are taken at a range of vapour pressures and 
the width, plotted as a function of atomic density, extrapolated to zero density. 
The broadening at higher vapour pressures allows a study of interatomic 
interactions. (See, for example, Happer and Saloman"”: Omont and 
Meunier.“!®) 


5. COMPARISON OF DouBLE RESONANCE AND LEVEL-CROSSING EXPERIMENTS: 
MODULATION TECHNIQUES 


While double resonance experiments have been extensively applied in studies 
of excited atoms, they suffer from serious disadvantages which arise from the 
need to use a strong oscillating field on the sample. 


(i) It is not always a simple matter to achieve the necessary amplitude of 
magnetic field at the high frequencies required. 


(ii) The oscillating field ‘perturbs the resonances in two ways; first by 
broadening them and secondly by shifting their positions. (See Section 
IV). 


(iii) The strong oscillating fields sometimes set up gas discharges in the 
resonance vessels and the light from such discharges, though weak, is 
sometimes sufficient to swamp the fluorescent light. 


For these reasons level-crossing experiments are to be preferred. The sole 
external perturbation on the excited atoms is then simply the beam of light 
used for excitation. Level-crossing experiments can give values for lifetimes of 
excited states accurate to 1 or 2% in favourable cases. Measurements to this 
precision demand a careful study of the line profile, which is of the pure 
Lorentzian shape only for certain well-defined geometrical conditions. The use 
of finite light cones may introduce some mixture of the dispersion shape—a 
point which must be carefully studied. 
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Measurements of the position of level-crossing curves yield values for the 
hyperfine interaction constant which are fundamentally limited only by the 
natural width of energy levels. However, calculation of these interaction con- 
stants from the measured magnetic fields at which the level-crossing signals 
are found depends on a knowledge of the g,-factor of the states, and if this is 
not known from optical measurements it must be determined by a double 
resonance experiment. Indeed, the determination of g,-factors is now the most 
important application of the double resonance method. 

An alternative to the double resonance method presents itself in the use of 
modulated light. If a level-crossing experiment is performed with modulated 
light the level-crossing signals are displaced to the position where the modula- 
tion frequency matches the interval between the levels.'*>? From the measured 
fields and frequencies the g-value can be deduced. This is a type of double 
resonance experiment which requires no oscillatory field to be applied to the 
sample. The curves obtained are identical with level-crossing curves—that is 
to say they suffer neither from radio-frequency broadening nor from displace- 
ment of the resonances. The technical difficulties are now transferred to the 
means of modulating the light. Because of these difficulties the method has 
not been widely applied. It has, however, been used to verify the physical 
reality of the frequency diagram—a representation of combinations of the 
nutational and precessional frequencies in the motion of atoms under oscil- 
lating fields (See Section III). The frequency diagram for the excited >P, state 
of mercury has been explored in this way.4°+*”) 


6. SOME TYPICAL RESULTS AND SOME RECENT APPLICATIONS OF OPTICAL 
PUMPING, DOUBLE RESONANCE AND LEVEL-CROSSING TECHNIQUES 


No attempt will be made here to give a complete account of work in this field. 
Some recent compilations have been made by zu Putlitz,¢® by Budick,¢” and 
Major.©® The experiments described in the following sections have been 
chosen to illustrate a variety of applications. 


6.1. HYPERFINE STRUCTURE OF HYDROGEN 


As an example of the application of optical pumping to measure quantities of 
fundamental importance in atomic physics, the series of experiments by Pipkin 
and his colleagues®” on the hyperfine structure of hydrogen, deuterium and 
tritium may be cited. The measurements of the hyperfine intervals were made 
to a precision of a few hertz, the intervals themselves being of the order of 
10° Hz. Knowledge of these intervals is of the greatest importance in verifying 
theories of atomic structure and radiation interactions and in determining the 
value of the fine structure constant. 
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6.2. PRECISION MEASUREMENT OF ATOMIC g-FACTORS 


As an example of more recent precision measurements in optical pumping we 
refer to the work of White et al.©” on the g-factor ratios for free rubidium 
atoms. Precision measurements of such quantities were formerly made by the 
methods of atomic beam resonance spectroscopy. Optical pumping methods 
offer an alternative technique. In the work of White et al. the ratios g,/g; were 
measured for °°Rb and ®’Rb to a few parts per million and the ratios g,(8’Rb)/ 
g,(®°Rb) to a precision of a few parts in 10°. These results were obtained from 
evacuated wall-coated cells. Distortion of the Lorentzian line shape was found 
for cells filled with buffer gases. The results were corrected for shifts arising 
from the pumping light. 


6.3. RELAXATION STUDIES BY OPTICAL PUMPING 


An extended series of experiments on the relaxation of optically pumped 
rubidium has been carried out by Mme Bouchiat. A recent report? gives an 
account of the relaxation of rubidium on paraffin coated walls. The results are 
compatible with other measurements on the interaction of rubidium atoms 
with buffer gases. 


6.4. DETECTION OF OPTICAL PUMPING BY RADIOACTIVE EMISSION 


A variation of the optical pumping method has recently been used to find the 
nuclear spins and moments of certain short-lived radioactive isotopes.©* It is 
applicable to isotopes which can be produced by the bombardment of a gas by 
particles from an accelerator and whose lifetime is short enough for them to 
decay before they have had time to diffuse through the gas to the walls of the 
containing vessel. 

The isotopes produced in this way are polarized by circularly polarized 
light, either directly or through spin-exchange. Their polarization is monitored 
by measuring the asymmetry in the B—emission. The polarization is destroyed 
by a radio-frequency field at resonance with the Zeeman or hyperfine inter- 
vals and the signal is detected by a change in the counting rate. 


6.5. LEVEL-CROSSING EXPERIMENTS ON HYDROGEN 


The fine structure of excited states of atomic hydrogen is no less important 
than the hyperfine structure in atomic physics. By the application of magnetic 
fields of a few thousand gauss the spin-orbit coupling is broken down and 
Zeeman levels of 2P3/2 intersect those of 2P;;2. A measurement of the crossing 
point allows a determination of the P;,. —P3,. interval and of the fine structure 
constant. A level-crossing experiment to determine these quantities was re- 
cently reported by Metcalf et al.©* Similar studies have also been undertaken 
by Fontana and colleagues.*°®) The sharpness of the resonances is less than 
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in the measurement of hyperfine structure because it is limited by the short 
lifetime of the excited P-states. The fine structure constant is determined to a 
precision of about one part in 10°. 


6.6. LEVEL-CROSSING EXPERIMENT ON MANGANESE 


An example of an experiment in which a weak signal from a complicated 
structure was analysed to determine the hyperfine structure is the level-crossing 
experiment of Handrich et al.,” on the MnlI spectrum. The modulation 
necessary for phase sensitive detection was achieved by rotating a polarizer in 
the fluorescent beam. The signal was retrieved from noise by the use of an 
Enhancetron. 


6.7. HYPERFINE STRUCTURE OF EXCITED ATOMS 


As an example of a recent application of double resonance methods to the 
measurement of a hyperfine structure, we cite the work of Feiertag and zu 
Putlitz.°® This study of the 7?P,,. term of **Rb was of value in assessing the 
contribution of core polarization to the magnetic hyperfine structure. 


6.8. MEASUREMENT OF g,-FACTORS OF EXCITED STATES 


The structure of levels of excited atoms is of interest not only from the point 

of view of hyperfine structure but also because measurements of g,-values 

allow one to assess the degree of breakdown of L-S coupling, for example, 

and to determine whether or not configuration interaction needs to be taken 
into account. In this context the double resonance method was used by Ma 

et al.,°° to measure the g-factors of low-lying 3P, states of strontium and 

barium. The g-factors were measured to a precision of a few parts in 10°. 


7. APPLICATION OF OPTICAL METHODS TO MOLECULES AND IONS IN SOLIDS 


We conclude this section with a brief reference to work on systems other than 
atoms. 


7.1. MOLECULES 


A detailed analysis of the possibilities for molecular level-crossing spectro- 
scopy has been given by Zare.©°) Crosley and Zare®°” applied this method to 
nitric oxide. The width of the zero field level-crossing curve for the state investi- 
gated was about 0-4 G, but this cannot be interpreted as a lifetime until the 
g-value is known. A double resonance experiment on the same state allowed 
the determination of the g-factor and hence the lifetime.” 
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In the case of molecules far more than in atoms, accidental coincidences of 
wavelength allow the excitation of the molecule by line radiation from a lamp 
of a different substance. 


7.2. PARAMAGNETIC IONS IN SOLIDS 


As an example of the application to ions in solids, we refer to a recent paper 
by Imbusch et al.°*) on excited states of V?* and Mn** in Al,O;. This work 
allowed the measurement of hyperfine structure and spin-lattice relaxation of 
excited ionic states. References to earlier work in this field are given in the 
paper. More recently it has been shown by Chase* that double resonance 
applied to excited states of ions leads to modulation in the fluorescent light, 
just as in the case of free atoms. 


Il RADIO-FREQUENCY INTERACTIONS 


(a) COHERENCE AND OPTICAL MONITORING 


1. DISTINCTION BETWEEN OPTICAL FIELDS AND RADIO-FREQUENCY FIELDS 


Shere is an important distinction to be drawn between the interaction of atoms 
with fields of extended spectral range (broad-band) on the one hand and of 
sharp frequency on the other. In the context of optical pumping experiments 
the interactions with light (including spontaneous emission) come under the 
former heading and with radio-frequency fields under the latter. For broad- 
band interactions the transition probabilities are independent of time and the 
absorption and emission processes can be represented by exponentials. Such 
processes are called “rate processes’’. In the Einstein treatment of the inter- 
action of radiation with atoms, absorption, stimulated emission and spontan- 
eous emission are treated as rate processes. For monochromatic fields on the 
other hand the transition probabilities are time-dependent, leading to a sinu- 
soidal oscillation of population between two states connected by the perturba- 
tion. This sinusoidal oscillation is formally called nutation, but in the context 
of magnetic resonance one commonly speaks of spin-flipping. It is important 
to recognise the existence of two distinct frequencies in the context of magnetic 
resonance: the driving frequency whose phase is determined by the phase of 
the laboratory oscillator, and the nutational or spin-flip frequency for which 
the phase depends on the moment at which the system is introduced into the 
field and which in steady state experiments is a random variable. 
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It is, of course, possible to obtain the result for fields of extended spectral 
range by integrating over the results for monochromatic fields. It is not diffi- 
cult to show analytically—and, indeed, the conclusion is very plausible—that 
the response of the system is exponential rather than sinusoidal if the ampli- 
tude of the initial state (in the interaction representation) does not change 
appreciably in the coherence time (reciprocal of the spectral range) of the 
exciting radiation. From this result one may deduce the following condition 
for the exponential response: the spectral range of the exciting radiation must 
be greater than the perturbation (measured in frequency units) applied to the 
final state of the transition. This perturbation may be that which arises from 
the exciting field itself or it may be some other purturbation such as an addi- 
tional relaxing process. 


2. MONOCHROMATIC FIELDS: SEMICLASSICAL OR QUANTIZED FIELD 
TREATMENT ? 


In the semiclassical treatment one describes the field classically and the matter 
quantum mechanically. The field variables are treated as ordinary algebraic 
quantities (c-numbers) in the analysis, but operators are used to describe 
dynamical variables of the atoms. 

In the quantum field theory the radiation is quantized as well as the matter 
and the field variables are operators. The state functions are functions both of 
the field and of the matter. 

The important difference between the two approaches lies in the treatment 
of the spontaneous emission. This needs no special consideration in the quant- 
ized field method, but needs supplementary assumptions in the semi-classical 
method. We shall state these assumptions in due course when we need to use 
them. All we need at the moment is to know under what circumstances the 
two treatments are equivalent. 

They are equivalent (if properly formulated) when spontaneous emission 
can be ignored. This is ordinarily true in magnetic resonance or in other situa- 
tions when the field is monochromatic and the mean photon-number is large 
compared with unity, for in this case the probability for spontaneous emission 
is much less than the probabilities for absorption and stimulated emission. 
This is an example of the normal condition under which quantum theory goes 
over into classical theory, namely that the contemplated change in a quantum 
number (in this case the photon-number) is large in comparison with the value 
of that number. But it is worth noticing that the quantized field and semi- 
classical theories are also equivalent for absorption processes treated by first 
order perturbation theory even when the optical fields are weak, (mean 
photon-number per mode much less than unity) for when one is dealing with 
absorption the possibility of spontaneous emission is irrelevant. 
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In the quantized field theory, as normally written, the basis states are 
energy eigenstates of the system (atoms+ optical field), and the radio-fre- 
quency field is treated classically. In a recent formulation, Cohen-Tannoudji 
and Haroche®*) have taken (atoms+ radio-frequency field) as the closed 
system whose energy eigenstates form the busis states. Time-dependence is 
treated by using superposition states. 

In the semiclassical theory, time dependences, both of the optical and radio- 
frequency fields, appear as c-numbers, and it is very easy to follow phase 
factors from the fields to the atomic wave function and back to the fields. 

We shall use the semi-classical treatment in this account. Detailed analyses 
will be restricted to studies of the radio-frequency interactions. We shall avoid 
difficulties connected with spontaneous emission by making use of the “‘pulse 
approximation”, and we shall generally use the pulse approximation also in 
dealing with the absorption of light. 


3. MONOCHROMATIC FIELDS: SEMICLASSICAL TREATMENT 


Two cases can be solved exactly: 
(i) rotating field in the plane perpendicular to a static field. 
(ii) field oscillating parallel to a static field (provided that the atomic coupl- 
ing is not broken). 


The methods of solution are different for these two cases. We shail treat the 

first fairly thoroughly and the second briefly, in Section III.7. We shall be 
thinking of magnetic fields, though much of the work will be applicable to 
electric fields also. 


3.1. A FIELD ROTATING IN THE PLANE PERPENDICULAR TO A STATIC FIELD 


(This is the standard arrangement for magnetic resonance. The problem and 
its solution are well known but we wish to establish a notation and to derive 
some results for reference.) 


Let the field be Hk +H, (cos Wot it+sin wot j), (14) 


where i, j, k form a set of Cartesian unit vectors. Let us write for the Hamil- 
tonian of the system Hy + #1, where HW, includes all time independent and 
isotropic interactions in addition to the interaction with the static field H and 
HH , describes the interaction with the transverse field H,. 
The method of solution is: 

(i) make a transformation to eliminate the time dependence; 


(ii) integrate the equation of motion and write in the time development; 
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(iii) Diagonalize the transformed Hamiltonian; 
(iv) transform back to the initial frame. 


This sequence of operations yields the operator which gives the development 
in time of the state vector or density matrix of the atomic system (time- 
displacement operator). The matrix elements of the operator are easily found 
and the transition probabilities readily obtained. 

Let us study the steps (i) to (iv) in turn and then apply the method to a simple 
case, 


(i) The operator which removes the time dependence is exp (i/h) J, wot 
where J, is the operator representing the component of angular momentum J 
along the direction of the static magnetic field (z-axis). As is well known this 
operator transforms the Hamiltonian #,)+H#, to 


KH! = Ho tH1'-O Jd, (15) 
where H ,’ = exp [(i/h)J, Wy 1] #, exp [—(i/h)J, wo t]. 


We shall apply this transformation first at the particular time t = ty at which 
the atoms begin to experience the field, and secondly at the later time ¢ at 
which we sample the system. For a simple spin system the transformation is 
equivalent to changing the field given by eqn (14) to the effective field 


Here = (H—@o/y)k +H, i (16) 
(y is the gyromagnetic ratio). 


(ii) The equation of motion is 
7) 
int) = Nt), (17) 


where |'t> is the state vector at time ¢ in the transformed frame. Since #’ is 
independent of time the equation integrates to 


['t> = expl — G/A) H(t t9)]I'to>. (18) 


We can most easily apply this equation if #’ is diagonalized. This leads us to 
the next step. 


(ii) Diagonalization of the transformed Hamiltonian is easily accomplished 
for the simple spin system by choosing a representation in which the axis of 
quantization is along Hyg¢,, i.e., by a rotation of Oz through the angle B about 
the x-axis from the direction of H to the direction of H,,,. 

B = tan” *H,/(H —Ho). The operator is the rotation matrix 9” (0, B, 0). 

In a more complicated system where the fields are strong enough to intro- 
duce some decoupling of spins we suppose that the Hamiltonian is diagonal- 
ized by an operator S$." 
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In either case let us suppose that the eigenvalues of the diagonalized Hamil- 
tonian are 4,, that is to say the A; are the eigenvalues of S#’S™' or of 


B(p)H' BD — B). 
We now make use of the relation 
exp —(i/h)#'(t—to) = S~* {exp—(i/A)SH’S 7 (t- to)} S, (19) 


and notice that the time development of #’ has been replaced by the time 
development of S#’S~' whose eigenvalues will appear as the frequencies in 
the periodic motion. These eigenvalues are the nutational frequencies. 


(iv) The final transformation is to apply exp[ —iJ,@of ] which takes us back 
to the laboratory frame at time f. 

By this sequence of operations we have found the form of the time-dis- 
placement operator U(t, ty) defined by 


|t> = U(t,to)|to>. (20) 
We have 
Ult, to) = {exp—(iJfi)J, 9 1} S7! {exp —(i[A)SH'S”*(t—10)} 
x S{exp (i/h)J, Wo to}. (21) 
a. Example 


A particle with spin-}. In the field H alone, let the eigenstates be |+), |—}. 
The energy eigenvalues are +4hw, with w = yH. Probability amplitudes are 
a,,a_. Suppose we are givena, = 1,a_ = Oat time fy. The problemis to find 
the values of a, and a_ at later time f¢. 

The state vector at time fy is (g). Using (20) and (21) we can immediately 
write down 


—)) _ (eae 0 ean a) 
ese = 0 expt timot /\ Si}, Sit 


Ali S 
ae yip(t— to) . ) (22) 
0 exp +4ip(t— fo) 


(ce a aa Naas 0 Vo) 
So Se. 0 exp— iy to / \0 


We have used the values + 4h for the eigenvalues of J, andj, = +4hp for the 
eigenvalues of S#/’S~'. If for this case, we identify S with ® (B), we have 


ela eas (23) 
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Performing the matrix multiplication we find 


( 4 _ (= — iw, t{c? exp —4ip(t—to) +5” expt tip(t— 7) 
a_(t)} \exp++4ias t[sc exp —4ip(t—to)—sc exp + dip(t—to)] 


xX expt+tiagty, (24) 


whence |a..(|? = 1-4 sin?B[—cos p(t—to)], (25) 
la_(t)|? = = 4:sin*B[1 —cos p(t—to)], (25a) 

and 
a,a_* — $sin B{c?[1 —exp—ip(t—t)] —s*[1 —exp + ip(t— to) J} exp— img t. 
(25b) 


tan B = H,/(H—Hp)), where Ho = @o/y. 


Equations (25) and (25a) give the populations of the states. Notice the sinu- 
soidal variation with frequency p. Notice also that the driving frequency @o 
does not appear in these equations. 

Equation (25b) gives the correlation between the states. The driving fre- 
quency appears in this equation as well as the nutational frequency, p. Notice 
that p occurs with the time interval (t—¢)), whereas wp, occurs with the time f. 


b. Frequency diagram 


Let us study the periodic terms which occur in equation (24). Firstly, the com- 
mon factor exp+4im@ofo is-irrelevant. It represents an arbitrary phase factor 
common to both states. Next, we see the factors exp +4i@ot. These are periodic 
functions at frequencies mw ), where m is the space quantum number with 
respect to the direction of the field H. But each of these terms is multiplied 
also by periodic terms with frequencies +4p. The +3 in these factors is the 
space quantum number with reference to the direction of Hg. 

These frequencies and the corresponding quantum labels can be given a 
convenient diagramatic representation. 

Figure 32a shows the energy levels of the states |+}, |— > as a function of 
the magnetic field H. The levels are described by the equations @ = +4yH. 
(It is convenient to speak of the energy at this stage although the equation we 
have written is for the corresponding Bohr frequency). 

Figure 32b shows the energy levels seen in the rotating frame, still plotted 
against the laboratory field H and supposing that the transverse field H, has 
not yet been applied. 

Figure 32c shows the energy levels for a finite value of H,. These are the 
eigenvalues 1, of D(fP)#/’D(—f). They are described by the equation 


p= t4hyHeer = thy[(H Ho)? +Hy7F. (26) 
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But we must remember that the eigenstates are now described with reference 
to the rotated axis. In order to emphasise this the states have been labelled 
@® © These states are superpositions of the original states as follows: 


ID> = cos B/2|+ > +sin B/2|—>, 
|O> = —sin B/2|+>+cos B/2|—). (27) 


Figure 32d shows the energy levels after the transformation back to the 
laboratory frame. The level labelled @ gives rise to two levels displaced by 
+4 , corresponding to the eigenvalues +4 of J, for the states |+), |—). 
Similarly the level labelled © in Fig. 32c gives two levels in Fig. 32d. In Fig. 


+ 
e e (a) 
rT 
+- 
: (b) 
5 . 
0 Wo/y —>H 


Fic. 32a. Energy of spin-+ system as function of magnetic field. 


b. Energy of spin-4 system in rotating frame (angular frequency wo: rotation 
parallel to H). 


32d the interval between the pairs of levels labelled +, @ and —,® or +,06 
and —, © is wo, whereas the interval between a pair labelled +, @ and +, © 
or —, ® and —, Gisp. 

Diagrams such as Fig. 32d are generalizations of ordinary term diagrams. 
An ordinary term diagram is a representation of the energies of different 
modes of a system where the Hamiltonian is time independent. The frequency 
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diagram shows the side bands which appear when a periodic perturbation is 
applied. The levels have the same physical reality as do the energy levels in an 
ordinary term diagram. They may be explored, as we shall see, by suitable 
monitoring of the system. 


® 
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Fic. 32c. Energy in rotating frame: addition of transverse field Hy. 
d. Frequency diagram in laboratory frame. 


In general the nutational frequencies do not appear in steady state experi- 
ments but may be found in pulse or modulation experiments. 


4. APPLICATION TO DOUBLE RESONANCE EXPERIMENTS 


In such experiments the considerations of the previous section apply to the 
excited states. Atoms are excited by broad band irradiation, experience the 
interaction with the magnetic fields, and decay by spontaneous emission of 
light. In a steady state experiment the irradiation with light is continuous and 
one needs to justify the separation of the interaction into the three parts: exci- 
tation, radio-frequency interaction, and spontaneous emission. The justifica- 
tion lies in a comparison of the coherence times of the optical and radio- 
frequency perturbations with the mean lifetime of the excited atoms. For 
broad band irradiation the coherence time of the light is typically 107 *° sec, 


454 G. W. SERIES 


whereas the mean lifetime is typically 107 * sec. Thus, phase continuity under 
irradiation is lost in a time short compared with the mean lifetime and it is a 
good approximation to consider the excitation as a random sequence of pulse 
processes. The coherence time of the radio-frequency interaction, on the other 
hand, is, for a continuous wave oscillator, the period for which the oscillator 
is switched on, so that continuity of phase of the radio-frequency perturba- 
tion is something that cannot be ignored. As for spontaneous emission, it is 
convenient to think of this as the result of a perturbation stimulated by random 
fluctuations having the spectrum of black body radiation at the absolute zero. 
For such a spectrum the coherence time needs careful definition, but it suffices 
here to think of the coherence time as the reciprocal of the frequency which is 
being stimulated. The time we have to consider therefore is of order 107° sec 
which allows spontaneous emission also to be treated as a pulse process. 


Our model is therefore as follows: 
(i) an atom is excited at time fy; 


(ii) its wave function evolves in the excited state under the influence of the 
magnetic fields. It is nearly always true in double resonance experi- 
ments that the wavelength of the radio-frequency field is large compar- 
ed with the size of the experimental vessel, so that for different atoms 
we may ignore the dependence of the phase of the radio-frequency 
field on position and take account of the time variation only; 


(iii) it decays at time ¢ later than tp. We need make no distinction between 
the time of emission of light and the time of arrival at the photo- 
detector since the time of flight is negligibly small compared with the 
lifetime of the excited atoms and with the period of the radio-frequency 
field. However, since the atoms are distributed at random in the experi- 
mental cell, there can be no interference between light from different 
atoms. The net photo-electric signal is given by summing the effects of 
single atoms; 


(iv) to describe a steady state experiment the signal at time ¢ is found by 
summing over random times of excitation fg. That is, if the rate of exci- 
tation is constant, R, the number of atoms excited in the interval df, is 
Rdfy and the net effect is found by integrating over tg. (This approach 
neglects the statistical fluctuations). 


The problem is reduced, therefore, to calculating the light emitted at time ¢ 
by an atom known to have been excited to a particular state (or superposition- 
state) at time fg. The calculation of the last section has enabled us to calculate 
the state function at time ¢ and we need now to write an expression of the in- 
tensity of the emitted light. We shall see in a later section that it is of interest 
also to consider the state vector of the atom after the emission of light. 
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4.1. EMISSION OF LIGHT 


In the semi-classical theory applied to emission from atoms in energy eigen- 
states, the well-known procedure is to suppose that each excited atom will 
radiate like a set of independent oscillators whose dipole moments are given 
by the matrix elements of the electric dipole operator between a specified exci- 
ted state and all lower states which connect with it. We need to generalize this 
procedure for the present situation where the atoms are being subjected to a 
time-dependent perturbation (the r-f field). We must now take dipole matrix 
elements between eigenstates of the perturbed system, and if we express these 
in terms of the energy eigenstates of the system without the radio-frequency 
field we shall find superposition-states and time-dependent phase factors both 
in the excited states and in the ground states. Moreover, the levels concerned 
are those of the frequency diagram rather than the time-independent term 
diagram. But we should notice that the transition amplitudes are very small if 
the system is far from resonance, (e.g. if the angle B is small in eqns (23, 24, 
25) the amplitude a, remains very close to unity and the amplitude a_ is 
always extremely small). 
We generally meet one of the following two cases: 


(i) the ground state is single (J = 0), in which case the question of radio- 
frequency mixing does not arise; 


or (ii) if the ground state has structure the intervals are generally so different 
from the intervals between the excited states that when the radio- 
frequency field is near resonance for the latter, it is far from resonance 
for the former. 


The conclusion is (so long as either (i) or (ii) above is true) that the required 
electric dipole matrix element is between a superposition-state for the excited 
states and an energy eigenstate of the unperturbed system for the lower state. 


a. Example 


Let the two-level system which we formerly discussed now represent the excited 
state of an atom. We shall use mm or m’ to label the two eigenstates. We may use 
the equations of Section III 3.1.a for this case, provided we take account of 
radiative damping. This we do by multiplying the amplitudes a, and a_ of 
the eqn (24) by exp —4I'(t—¢9), supposing that both states decay at the same 
rate. 

Suppose that the atom in its ground state is also a two-level system labelled 
by y, yw’. Then the electric dipole matrix elements which we need are <¢p|P|t), 
where yz can be + or —. The component of the radiation which passes an 
analyser parallel to the unit vector e depends on <ple. P|t>, and the intensity 
of light passing this analyser will be proportional to the modulus squared. 
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This will be the intensity at time ¢ emitted by an atom excited at time fo. We 
thus have 


I(t, to.) = K ik<ule. Ple>|?, (28) 


where K is an unimportant constant. (In the quantum field theory, this equa- 
tion gives the probability of finding a photon at time £). 


Using the expansion 
It) = a,()|+> + a_@I-), (29) 
with a,,a_ given by (24) we have 
I(t, to) = K ¥) <ule. Pim) <m’le* . Plt) an Qn* (30) 
u 


The intensity at time ¢ resulting from excitation at the uniform rate R is 
given by 


t 
I(t) = KRA yp i ant, to) dy*(t, to) exp—T(t— tat, (31) 
0 


= KR Tr(Ao). (31a) 


In writing eqn (31a) we regard A as a matrix whose (m, m’) element is given 
by 
Amm = \ <ule.Plm><m'le*. Plu, (31b) 
‘a 
and o, whose (m, m’) component is given by the integral in eqn (31), as the 


steady state density matrix for the excited atoms. 
The elements of o are: 


1 pe 
O44 = aes ceo (32) 
1 b? 
b 6+iT . 
o,.=0*%, “Or Peap? om? oh (32b) 


where b=yH,, 6=y(H—H)), p= 7[(H—H,)*+H,7]?. 


By suitable choice of the type and orientation of the analyser which trans- 
mits the light, the elements of the matrix A can be given real or imaginary 
values or zero. Thus the intensity of the light will be represented by a sum of 
elements of the matrix o weighted by factors which are under the control of 
the experimenter. The diagonal elements of o will contribute time-independ- 
ent terms, but for those configurations of analyser which yield non-vanishing 
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values for the off-diagonal elements of A, we shall find in the sum terms of the 
form 


bé_\sin bY \sin 
(pole Do t and (725) oe Wo t. (33) 


The intensity of the light will therefore be given as a sum of resonance 
functions of the Lorentzian type with denominators (I? + p*). The width at 
half intensity is 2[(T/y)?+H,7]*. Arising from the off-diagonal elements are 
terms modulated at the driving frequency. The amplitudes of modulation are 
resonance functions of the bell-shaped type, '/(I'*+ 7), or the dispersion- 
shaped type, 6/(I' +p”). It is worth noticing that the terms which arise from 
the diagonal elements of o are proportional to b? whereas those arising from 
the off-diagonal element are proportional to b. 

The resonance functions which we have found are precisely those which 
arise in the Bloch theory of magnetic resonance. Their occurrence here and in 
that theory may be traced to the fact that the signal which is being described 
is to be associated with a multipole of order | in the expansion of the density 
operator; in other words they are characteristic of the polarization of the 
system. If we were to look into the detail of the type of polarizer required to 
bring out the off-diagonal elements of o we should find that of necessity we 
must use a circular or elliptical analyser; a linear analyser would not suffice. 
Similarly, in order to excite the system as we have supposed at time fy into 
the state |+), that is to say, to excite a polarized system, we would have 
required a circular or elliptical polarizer. 

Had we studied a spin-1 system it would have been possible to contemplate 
alignment as well as polarization. Alignment can be generated and monitored 
by linear polarizers and analysers. In this case the resonance functions are 
slightly more complicated. They are given by Dodd and Series.” The moni- 
toring of the density matrix by various means has been discussed by Carver 
and Partridge.©® 


5. MODULATION IN DOUBLE RESONANCE AND OPTICAL 
PUMPING EXPERIMENTS 


5.1 MODULATION IN FLUORESCENT LIGHT 


We have shown how the effect of a radio-frequency field on a system near 
resonance is by no means completely described in terms of its effect on the 
populations of the states. The field injects coherence into the atomic system, 
which may formally be described in terms of the off-diagonal elements of a 
density matrix. We have shown how this coherence might be expected to give 
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rise to modulation in the fluorescent light in a double resonance experiment. 

These expectations have been fully borne out by experiment. Figures 33a 
to d show resonance curves for the amplitude of modulation of light in the 
Brossel-Bitter experiment.°?) Here we are monitoring a spin-1 system, so the 
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Fic. 33. Resonance functions for amplitude of modulation in the Brossel—Bitter experiment. 


a, b. In-phase and quadrature components at the fundamental frequency. The curves are 
derivatives with respect to magnetic field of the amplitude of modulation. Experimental 
curves on the left: theoretical curves on the right. (After Dodd, et al.) 


curves are not the simple Lorentzians but the more complicated functions 
referred to in the last section. In fact, the curves were taken by imposing on 
the system a low frequency modulation and by phase-sensitive detection so 
that the curves are, in fact, derivatives of the resonance functions. 
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Fic. 33. c, d. In-phase and quadrature components at the second harmonic. 
(After Dodd et al.) 


5.2. MODULATION IN ABSORPTION 


Our calculation of the radio-frequency interaction holds also for atoms in the 
ground states (provided an inequality of population has been achieved by 
optical pumping). Just as coherence between the excited states is shown by 
modulation in the fluorescent light, so also one might expect coherence be- 
tween ground states to be shown by modulation in absorption. This too has 
been studied experimentally. Figure 34 shows modulation in absorption from 
the metastable state 27S, in helium.7” 


6. TRANSFER OF COHERENCE 


6.1. TRANSFER OF COHERENCE BY EXCITATION 


Consider now a system in which coherence has been generated by a radio- 
frequency field, for example, the ground states in an optical pumping experi- 
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Fic. 34. Experimental and theoretical resonance curves for modulation in absorption from 
the metastable state 2°, in helium. In-phase and quadrature components at the second 
harmonic. These are to be compared with figures 33c and d. Experimental conditions: no 
polarizer, linear analyser, @o/27 = 0-5 kHz, FY = 26 kHz, 6/T = 90:515. 
(After Partridge and Series.) 

ment as in the last section (2°S, helium). Suppose, for preciseness, we consi- 
der the coherence between the states [+1 > and |—1} and ask what would be 
the effect of illuminating the atom with light polarized parallel to the magnetic 
field (x-light; Am = 0). There exist dipole matrix elements between 2°S, and 
2°P,; the states |+1) of 23S, connect with |+1) of 2?P, respectively. There- 
fore, provided the spectrum of the light spans both these transitions in absorp- 
tion, one might expect that the coherence between |+1> in 27S, would be 
transferred to |+1) in 23P,, and this indeed takes place. (See Fig. 35.) 

We need however, to consider this possibility a little more carefully. While 
it is true that the coherence can be transferred by a single pulse process, it is 
not necessarily the case that such coherence will be transferred in a steady- 
state experiment. We can think out this question by analogy with the theory 
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of electric circuits. The coherence we are trying to inject into the upper state is 
represented by a periodic function at a specified frequency. The upper state 
(described by its density matrix) is to be thought of as a receiver through which 
the coherence is to pass. We know it will pass if the applied frequency matches 
the tuned frequency of the receiver to within its bandwidth. In our analogy the 
tuned frequency of the receiver is given by the interval between the eigenstates 
|+1) selected by the optical transitions. The bandwidth is given by the excited 


+1 


Modulation 
at 2Wwo 
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Fic. 35. Transfer of coherence from ground states to excited states and resulting modula- 
tion of the fluorescent light. The intervals between the levels are at the second harmonic of 
the driving frequency «>. They are not the same as the intervals in a static magnetic field. 


state damping constant. In a practical case we might be trying to inject a 
frequency of some kilohertz or megahertz into excited states separated by an 
interval of a few megahertz but with a damping constant of several hundred 
megahertz. Because the bandwidth is so large the frequency mis-match is 
irrelevant in many actual cases. It is very easy to transfer coherence from the 
ground states to excited states in optical pumping experiments. 

The coherence thus transferred is, of course, shown up in the fluorescent 
light. For this reason it is possible to monitor optical pumping signals both 
for modulated as well as unmodulated components by studying the fluorescent 
light. 
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6.2. TRANSFER OF COHERENCE IN DECAY 


The possibility of transferring coherence exists for spontaneous decay as well 
as for excitation by light. We must consider two significant differences: 


(i) spontaneous decay is an isotropic process whereas absorption can be 
anisotropic; 


(ii) the restrictions imposed by the bandwidth of the receiver are very severe 
in the case when the receiving states are ground states. 


Let us consider the consequences. 


Experimental curves 
I= 203 kHz 


-l0 1) 10 


=10 -—— 


Theoretical curves 
b=05Ir 
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Fic. 36. Demonstration of circulation of coherence in optical pumping in helium. Experi- 
mental recordings compared with theoretical curves of the amplitude of modulation at 4a. 


Experimental conditions: linear polarizer, no analyser, @o/27=-0°5 kHz, [=20°3 kHz. 
b/['=0-:5. (After Partridge and Series.“7™) 


(i) Although spontaneous emission is isotropic we may still resolve the 
perturbation into modes of sharp frequency and specified polarization. The 
integration over frequencies gives the result for a steady-state broad-band 
process which we analysed in the last section. It remains to perform the addi- 
tion over the three orthogonal modes of polarization. This eliminates the pos- 
sibility of transfer of coherence through the agency of different modes, (e.g. a 
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x—transition cannot couple with a o* transition), but there remains the pos- 
sibility for transfer by the same mode acting in both transitions (z with z, 
o* witho* ando witha). 

(ii) Turning now to the bandwidth of the receiving states, this may be of the 
order of a few kilohertz or even a few hertz. Therefore, the frequency to be 
injected must match the interval between the ground state energy levels to 
within that tolerance. The levels we are here concerned with are those appro- 
priate to the system perturbed by the radio-frequency field, namely, those 
which appear in the frequency diagram, rather than the energy levels appro- 
priate to the system in a static field alone. The condition of matching finds math- 
ematical expression in factors such as {T,+i@oL(Hy — y')— (2 — M2’) * 
which govern the probability for transfer of coherence from the (414,, 41’) com- 
ponent of the density matrix to the (#2, #2’) component via absorption fol- 
lowed by spontaneous decay’. For w) > I’,, the probability is very small 
unless [(H4 — 1") — (#2 — H2’)] 1s zero. 

In Fig. 36 are shown resonance curves for the amplitude of modulation at 
4, in the optical pumping experiment in 2°S, of *He we referred to earlier. 
Evidence from the line-shape proves that the signals are due to coherence 
which was injected into the ground state, carried into the excited state and 


m=—1 m=1 


4Wo 


ee 


Fic. 37. Diagram showing the generation of coherence at 4q@» by the following sequence 
of processes: 
(i) coherent absorption of a+ and a7 light, 
(ii) transfer of coherence by z mode of decay, 
(iii) radio-frequency mixing. 
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back to the ground state by absorption followed by spontaneous decay, and 
monitored in the ground state by another absorption process. One of the 
sequences of transitions which contributed to this signal is represented in 
Fig. 37. It was shown that the strength of the signal was indeed frequency- 
dependent as is predicted by denominators of the form indicated in the last 


paragraph. 


6.3. TRANSFER OF COHERENCE BY COLLISION 


We have hitherto been speaking of processes taking place in individual atoms. 
We now consider the possibility of transferring coherence from one atom to 
another in collisions. 

The process of spin exchange which we mentioned in an earlier section 
(I 6.9) may be represented as a transfer of longitudinal polarization by colli- 
sion. The transfer of transverse polarization may be described as the transfer 
of coherence. There is no objection of principle, therefore, to the notion of the 
transfer of coherence by collisions. In fact, it takes place subject to the general 
rule that the frequency mis-match between the driving and receiving systems 
be not greatly in excess of the damping constant of the receiver. 

An example illustrates what, at first sight, might appear to be surprising 
features of the phenomenon. 

In a previous section we discussed the polarization of *He in the ground 
state 11S, by collisions with optically pumped *He atoms in the metastable 
state 2°S,. Magnetic resonance of the *He nuclei is monitored by detecting 
changes in the light absorbed by the metastable atoms. If one studies the ab- 
sorption with the appropriate geometrical arrangement one finds modulation 
at the frequency at which one is driving the nuclear resonance. This is evi- 
dence that coherence injected into ground states has been transferred to meta- 
stable states of the atoms by collisions. The resonance curve (Fig. 23a) has a 
line-width of the order of a few hertz. It is to be noticed that such curves are 
obtained by measuring the absorption of light from atoms whose lifetime is of 
the order of milliseconds. 


7. APPENDIX 
We now return to the second case mentioned in Section III 3. 
7.1. FIELD OSCILLATING SINUSOIDALLY PARALLEL TO A STATIC FIELD 
The interaction Hamiltonian is 


= —p.k(H+4H, cos ft) 
= y J(H+H, cos ft). (34) 


We assume that the field is not strong enough to break down the internal coupl- 
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ing (gyromagnetic ratio y independent of the field). J, is an operator. The 
Hamiltonian is readily diagonalized by choosing the z—axis for space quantiza- 
tion and it is possible to integrate the equation of motion directly, without first 
eliminating the time-dependence. 


Thus: 


, @ 
in \t = #|t> (35) 
yields : 
i> = [exp(—itty ['2¢(¢a1 | [.y = UC 10) lod, (36) 
to 
with é: 
[2eenae = 5.00 -10) + OH INGinft—sinfis)]. GB) 
to 
Use the expansion exp—iaJ,sinft= ) J,(aJ,)exp—irft (38) 
(J, is the Bessel function of order r). 
Then 


Im, t> = {exp —ima(t—tg) | Sy J,(am) exp—irft | 


Rm —~o 


x pd J,(am) exp —isft| }im, to» (39) 


(o=yH; a=yH,/f). 
Rearranging, we have 


Im,t) = ¥ J,(am) J,(am)[exp —i(r—s) ft] Lexp—i(mor+ sf) (t— 01] mod 


The periodic terms in the expansion are: 
(i) the precessional term with frequency (r—s)f, phase-locked to the driv- 
ing field, and 
(ii) the nutational term with frequency (mm + sf) associated with the time 
interval (t— tf). 


For steady state excitation the phase of this nutational term, as before, is 
random. However, under certain conditions this random phase disappears 
from the density matrix, whose (m,m’) component is: 


Omm = >. J,(am) J,(am) J,.(am’) J,(am') exp—i(r—s—r'+s') ft 


xX exp—i(ma+sf—m'oa—s'f) (t—to). (41) 
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The condition for elimination of the random phase is: 


(m—m')o = —(s—s’')f, 
@ = (integer) x f/(m—m’). (42) 


This is simply the condition for the degeneracy of certain levels in the fre- 
quency diagram. Degeneracy occurs when certain integral multiples of the 
driving frequency match multiples of the Larmor precessional frequency. 

The frequency diagram for the case J = 1 is illustrated in Fig. 38. At H = 0 
(@ = 0) the levels m = +1 are degenerate, but there exists a family of such 
degenerate pairs. The oscillating field splits each level into an infinite set label- 
led by integers r. Associated with each such level is a Bessel function whose 
order is the corresponding integer and whose argument is am = myH,/f. The 
degeneracy at zero field is removed by the static field H. 


m 
1 


—1 


—24/2 -#/2 Oo {/2 -2t/2 
w=yH 
Fic. 38. Frequency diagram for system with J/=1: magnetic field oscillating at frequency 
f parallel to static field H. Levels having m=0 are not shown. 

The degeneracies at zero and at non-zero fields given by eqn (42) are a type 
of level-crossing similar to that discussed in a previous section. If account is 
taken of the actual width of excited levels the degeneracy extends over a re- 
gion I. If atoms are excited from some initial state by light of polarization 
such that two different states m, m’ can be excited, we find in the fluorescent 


OPTICAL PUMPING AND RELATED TOPICS 467 


light an interference effect which we may describe as a resonance between the 
driving frequency and the Larmor frequency. These resonances have width I 
and suffer no radio-frequency broadening. The relative intensity of different 
resonances is given by a product of Bessel functions. The radio-frequency 
field strength governs these relative intensities. Resonances of this sort have 
been studied by Geneux and Favre,’ Aleksandrov et al.” and by Chapman 
and Series.’ It will be appreciated that modulation phenomena are associ- 
ated with the resonances. Figure 39 shows an example of resonances in the 
6°P, state of mercury. The curves show the amplitude of modulation of the 
fluorescent light at the second harmonic of the driving frequency which in this 
case was 5 MHz. This is the interval between resonances, and was chosen so 
that the resonances were resolved against their width (2 MHz). The occurrence 
of the outermost resonances indicates that the Bessel function J, was signifi- 
cantly different from zero, which implies that yH, was a substantial fraction 
of 5 MHz. 


Second harmonic lamp resonances (8=0) 


Resonance form: 
Un Urtds) cos (2 ft) 
f=5-OMHz 
[Emission along 
Field Axis | 


Signal strength 


Jy (Uy) 


—da(Uy +5) —UalJg +4) 


~2douo 


Ls-3 = =e fo) +4 +1 +3 
Ls=yHs/f Static field strength 


Fic. 39. Level-crossing in 37P, state of mercury subjected to parallel static and oscillating 
magnetic fields. Excitation by light polarized perpendicular to the fields. 


IV RADIO-FREQUENCY INTERACTIONS 
(b) PERTURBATIONS 


We took as an example in the last section the case of a two-level system inter- 
acting with a rotating field. The resonance functions were symmetrical or anti- 
symmetrical about the field H = Ho, which is the field where the applied fre- 
quency @, exactly matches the interval between energy levels in Fig. 32a, i.e., 
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the field where the levels cross in Fig. 32b. This is true also for multi-level 
systems where the levels are equally spaced as, for example, a spin-system 
having J>4, in a magnetic field. But it is not true for a rotating field inter- 
acting with a system where the levels are unequally spaced as, for example, in 
the case of a coupled-spin system which is being broken down by the magnetic 
field. In such a case the peaks of the resonances are displaced by an amount 
depending on H, and the resonance functions are no longer symmetrical. 
Such displacements are found, not only for the three-level system interacting 
with a rotating field, but also for a two-level system interacting with an oscil- 
lating field applied perpendicular to the static field. We shall consider these 
two cases. 


1. SySTEM OF THREE UNEQUALLY SPACED LEVELS INTERACTING WITH 
ROTATING MAGNETIC FIELD 


Let us first apply the transformation which removes the time dependence and 
consider the energy levels in the rotating frame as a function of wp». Consider 
the example shown in Fig. 40a where the states are quantized with respect to 
the axis of rotation and the energy levels have been displaced from their posi- 
tions in the laboratory frame by map. The transverse field H, is supposed zero. 

In Fig. 40(b) is shown the effect of applying the transverse field. The energy 


(F,Me) 


(2,0) 


———> Wo 
(a) 


Fic. 40. System of three unequally spaced levels. 
a. Energy in rotating frame, as function of @o. 
b. Perturbation introduced by transverse field H1. 


levels are perturbed, and we must recall that the original quantum numbers 
are no longer good quantum numbers. The levels in this diagram are the 
eigenvalues of the transformed Hamiltonian S#’S~' referred to in III 3.1 


(iii). 
1.1. DISPLACEMENT OF THE RESONANCES 
The centres of the resonances we shall take to be at the positions of closest 
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approach of the levels of the diagonalized Hamiltonian, though some argu- 
ment is needed to justify this assumption.‘’ >) These positions do not necessarily 
coincide with the intersections of levels in Fig. 40a. The intersection (0, —1) 
for example is displaced because matrix elements of the perturbation exist 
between |1> and |0> but not between |1> and |—1)>. 

As a first approximation to the correction we apply to the level 0 the pertut- 
bation it experiences from 1, A, 9. The resonance condition is then given in 
terms of A, and the Bohr frequencies Q, as follows: 


Qo = Aio = Q_, + Mo . (43) 
Second order perturbation theory yields 
A = lool? ais 
10 7 . 
i (Q | —@o)—-Qo 


bio is the matrix element of the perturbation. The denominator in (44) is 
the difference between the levels in Fig. 40a. 

Since |b,9|? is proportional to H,? the theory in this approximation pre- 
dicts a displacement of the resonances proportional to H ,”. This correction has 
been widely quoted in the literature (Salwen, for example). It is customary 
to plot the position of the peak of the resonance against H,? and to make a 
linear extrapolation to zero to obtain the unperturbed position. However, the 
approximation on which this procedure is based breaks down if Ajo is com- 
parable with the interval between the interacting levels. It is clear that a better 
solution is obtained by using for the energy denominator in eqn (44), the 
interval between the perturbed levels, Fig. 40b. An analytical solution has 
recently been given by Pegg who finds that in the case studied the displace- 
ment of the resonance was a function of |H,|, not of H,?. 


1.2. MULTIPLE QUANTUM TRANSITIONS 


In the last section we concentrated on resonances corresponding to intersec- 
tions of levels for which Am = +1, since the perturbation due to H, was sup- 
posed to have non-vanishing matrix elements between such states and no 
others. But the perturbation in second order connects levels with Am = +2, 
so that weaker resonances are to be expected for such intersections also. Thus 
a weaker resonance will occur for the intersection (1,—1) in Fig. 40b. The 
characteristics of the resonance may be read from the figure: 


(i) the position at which it occurs (the position of closest approach of the 
levels of the diagonalized Hamiltonian) will be approximately half-way 
between the intersections of (0,—1) and (1, 0) in Fig. 40(a); 


(ii) the width of the resonance will be one-half that of the Am = +1 re- 
sonances because the levels intersect twice as steeply. 
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Such resonances are called multiple-quantum resonances because they cor- 
respond to an exchange of two radio-frequency quanta. Their occurrence in 
double resonance experiments was described in Section I 3.1. 


2. Two LEVEL SYSTEM WITH OSCILLATING FIELD PERPENDICULAR TO STATIC 
FIELD 


This case, though apparently quite different from the preceding case, never- 
theless has much in common with it. 


2.1. DISPLACEMENT OF THE RESONANCES 


Figure 41a shows the energy levels as a function of H in the laboratory 
frame. Resonances at +H, and —H, are induced by the clockwise and coun- 
ter-clockwise rotating fields into which the oscillating field may be resolved. 
Each of these resonances perturbs the other so that their peaks do not fall 
exactly at the field +H ,. The shifts are known as Bloch-Siegert shifts. They 
may be understood by reference to the frequency diagram. (See Fig. 41b). 


—Ho fe) Ho 


SEER cand H 
Fic. 41a. Resonances for spin-} system at positive and negative values of the field. 


Consider first that component of the field which gives rise to the resonance 
at —H . The frequency diagram, Fig. 41b, shows levels in the region of +H 
separated by approximately the interval w,. The equations for the levels are: 


Onn = M(— Wo) +n (+ 9)? + [54 -|7]* 
RY —MOy +n(o+ po) [14+4]b4-|?/(W+@o)7], (45) 
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where m, the quantum number along the static field, takes the values +4 as 
does n, the quantum number referred to H,,;.@ = H, and b, . = 4yH, is the 
matrix element of the perturbation. 

The resonance near Ho is stimulated by the other component of the rotating 
field. This resonance occurs at that value of the field for which the interval 
between Q, , and Q_ _ is equal to wg, i.e. when 


QO = Qy + Q = —@Wo + (Dyes +@o) [1 +43\b, = ?/(@res+@o)7I; 
where Ores = Do — 4b, ~|?/(®,es + @o) = yH ~ é. (46) 
6 is the Bloch—Siegert shift. 
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Fic. 41b. Bloch-Siegert shift illustrated by means of the frequency diagram. 


2.2. MULTIPLE QUANTUM RESONANCES 


Consider now the frequency diagram for the component of the field which 
stimulates the resonance at +H, (Fig. 42). We see the possibility of a reson- 
ance near 3Ho, since here the pair of levels — + and + — (upper and lower, 
respectively) are separated by wo. Since the quantum numbers (laboratory 
frame) are inverted, this resonance needs the counter-rotating component of 
the field. The amplitudes of the interacting levels depend on the first rotating 
field, and are given by b, _/p (eqn (24)). The transition matrix element is pro- 
portional to b,_, so the amplitude of the resonance is proportional to 
|b, _|?b, _, and the transition probability to |b, _|°. 

The position of the resonance is not exactly at 3H but is displaced by an 
amount equal to the Bloch-Siegert shift. 

From Fig. 42 we may construct a further frequency diagram corresponding 
to the resonance near 3H, by transforming to the counter-rotating frame 
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(Fig. 43a) and back to the laboratory frame (Fig. 43b). We now see the 
possibility of a resonance near 5H, induced by the first rotating component. 
The process may be continued indefinitely. 


+@ 


0 Ho 2Ho 3H 


Fic. 42. Possibility of resonance near 3Hp illustrated by means of the frequency diagram. 


Each new diagram contains two new levels. For each successive resonance 
a new factor |b, _|* appears in the transition probability. If the resonances are 
studied with increasing values of b, — (proportional to H,) successive reson- 


ances appear in turn. The diagrams represent a treatment by successive approx- 
imation. 
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Fic. 43a. The frequency diagram of Fig. 42 transformed to the counter-rotating frame. 
The perturbation due to the transverse field H, (counter-rotating component) is shown. 
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A formal treatment which is equivalent to the foregoing approach has been 
given by Shirley ‘’” and has been adapted by Pegg‘’®) in the work previously 
quoted. 


+@O® 


-O® 


+O® 


Fic. 43b. Frequency diagram in the laboratory frame representing the effects of rotating 
field and counter-rotating field in first approximation. The diagram shows the possibility 
of a resonance near 5Hp. 


3. Two LEVEL SysTtEM WITH OSCILLATING FIELD: HANLE EFFECT 


Figures 41b and 42 show frequency diagrams for the two rotating fields 
separately, and in each case they are valid in the region H = 0. In Fig. 43b, 
representing the case of both fields acting together, the levels near H = 0 have 
not been drawn since, in this region, the perturbations due to both fields are 
comparable in strength and neither may be regarded as a perturbation on the 
other. The symmetry of the situation argues for a diagram which is symmetri- 
cal about H = 0, whereas both 41b and 42 are asymmetrical. 

The situation near H = 0 may be analysed by regarding the static field as a 
perturbation on the oscillating field, treating the latter by the method given in 
Section III 7. 

Consider the oscillating field H, cos ft, together with the small, static trans- 
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verse field H acting on our two-level atomic system (Fig. 44). Choose an axis 
of quantization parallel to the oscillating field (Oz). Let Ox be the direction of 
the static field. 


Zz 


ex 


H, cos ft 


Fic. 44. Hanle effect with an oscillating field. The interaction with the transverse field H 
is regarded as a small perturbation on the interaction with the oscillating field H, cos ft. 


The Hamiltonian is 
KH = (w,cosft)J,+oJ, 
=Ho+ KH. (47) 


The equation of motion under #, above may be solved exactly to find the 
states |t, to, -+ > which develop at time z from the states |+ > at typ. We have 


lt, fo,+> = }) J,Ga) exp —irft ¥. J,(4a) exp isfty | +) (48) 
lt,t9,-> = y J. (4a) exp ir'ft y J,(Ga) exp —is'ft, |—>. (49) 

The following is also a solution: 
It, fo, -—>* = Ds J(4a) exp ir'ft d Jy(4a) exp is’fto |—). (50) 

This state, which develops from 
Ifo, to, —>* = yy J,(@) exp igfty |—)>. (31) 


at time fo differs from [t, t),— > by an arbitrary phase factor. We shall find 
[t, £9,— >* more convenient than |t, fj, — > because its phase is more conveni- 
ently related to that of |t, tp, +> than is the phase of |f, fp, —). 

Notice that linear superpositions of the states we have written down are also 
solutions of the equation of motion under Ho. 

The complete problem requires the addition of #, to #5, so we try to find 
linear superpositions which are also eigenstates of #7. 


OPTICAL PUMPING AND RELATED TOPICS 475 


We notice that there are degeneracies in the t-dependence of |t, tp,+ > and 
|t, fg, —>*, namely, when r’ = —r. 


Now consider the superposition states: 
lt; fo. >* = C1/V2) [lt to. +> + It, to, —>*] 


= (I\/2) ¥, J,Gba) exp isfte | Jy (da) exp —irft|+> 


+ ¥ J,(a) exp w'f—)|, (52) 
For those terms for which r’ = —r we can take out the common factor 
J,(4a) exp —irft, using J,(4a) = (—1)’J_,Ga). 


We obtain 


tort * = (V2) E JuGba) exp isto] D J,(bad exp —irft 


x [+> +1->] + JGaexp ~ift 1+) = 1=>]] +. 
r odd (53) 
and similarly for |t, tp, —>*. 


The states (1/./2) [|+> +]—)] are eigenstates of #, with eigenvalues 
+4ho. Thus the states |f, tg, + >* have the following properties: 


(i) they satisfy the wave equation under #,; 


(ii) with neglect of terms r’ # —r, the even and odd parts of the wave 
function are separately eigenfunctions of #, for all t, since 


H [+> + |-d]/V2 = tho [|+> + |->]/V2. 


Thus the states |f, fg, + >* are convenient zero-order states on which to base a 
small perturbation #,. 

Let us calculate the matrix of #, in this basis. For the diagonal elements 
we have: 


*<t, to, £111, to, + >* 
= tho >, J,(ga) exp — irft » J,.(4a) exp —ir'ft| 
x >. J,(4a) exp isftp ¥) J,(4a) exp — is'fte| +¢.¢. 
= +hho ¥. J,(a) exp —ipft + c.c. 
Pp 


= +tho ¥ J,(a) cos pft (54) 
P 
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(We have used a Bessel function contraction theorem, with p = r+r’; by the 
same theorem the second square bracket is identically unity.) 


= +tha E o(a)+ } 2J,(a) cos oft] (54a) 
p=2 
even integers 


For the off-diagonal elements, 


*<t, to, FH |b, to, >” = £ tho YJ, (a) exp —ipft —c.c. 
Pp 


= +tiho }) J, (a) sin pft 


= +thiho | >, 2J,(a) sin pft | (54b) 
odd face 

Thus, for the time-independent component, and for the even harmonics, 

the matrix is diagonal. For the odd harmonics the matrix is diagonalized by a 

rotation of z/2 about Oz. The diagonalization removes the factor i froin (54b). 

If the oscillating field were absent (H, = 0), the matrix would be diagonal 

with elements Ey = +4hq. In the presence of the oscillating field we find 
diagonal elements, 


Ep |J0(@ + } 2J,(a) cos mi , (55) 
husmoaics 
and off-diagonal elements 
Eo >, 2J,(a) sin oft| . (56) 
odd 
harmonics 


Thus the oscillating field has the effect of modifying the g-value of the spin 
system. Measured in the x-direction the g-value is predicted to be 
a= Go| Jo(a) + ¥ 25,(a) 00s aft], 


even 


harmonics (57) 
and in the y-direction, 


Iy = Go x 2J,(a) sin of , (58) 
harmonics 

In the z-direction the g-value remains unchanged. 

The results have been obtained by finding the matrix of the perturbation 
# , in a representation based on the eigenstates of #9, and are valid only for 
ao <f. A more elegant treatment has recently been given by Pegg and 
Series.77” 

The time-independent part of this result was obtained by Cohen-Tannoudji 
and Haroche,‘’® and has been verified experimentally by them. Their deriva- 
tion was based on the notion of “dressed atoms”—they derived the stationary 
states of a closed system consisting of (atoms + radio-frequency field), and 
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used the coherent state formalism of Glauber” to form superposition-states 
representing the oscillating field. The experimental study was by means of a 
level-crossing experiment for ground states. 


V. REFRACTIVE INDEX AS A MONITOR OF 
OPTICAL PUMPING 


1. INTERACTION OF LIGHT WITH INDIVIDUAL ATOMS 


Our discussion of the interaction between light and atoms has been based 
on the interaction with individual atoms. Thus, in III, eqn (31a), we derive an 
expression for the intensity of fluorescent light: 


I(t) oc Tr(Ao), (59) 
where A is a matrix whose (m, m’) element is 
Amm = Y,<Hle. Plm><m'|e* . P| n> (60) 
Bp 


and a is the density matrix of the assembly. 

Equation (59) is obtained by summing the intensities of light from the dif- 
ferent atoms in the assembly. 

Equations similar to (59) have been used also for absorption. 

We should not lose sight of the fact that contributions to the intensity, either 
in emission or in absorption, may arise from the off-diagonal elements of the 
density matrix as well as from the diagonal elements. 


2. COHERENCE BETWEEN RADIATION FROM DIFFERENT ATOMS 


The basis of eqn (59) must be re-examined when there is a possibility of inter- 
ference between the light from different atoms. Such a possibility arises in the 
case of forward-scattering. The classical theory of refractive index interprets 
the dispersion of velocity of light in a medium as due to interference in forward 
scattering. Moreover, absorption is intimately related to dispersion and we 
monitor optical pumping by measuring the amount of light absorbed when a 
beam of light traverses the gas. We need to consider therefore, whether a 
theory of absorption based on the propagation of light through a medium 
(theory of dispersion) would lead us back to eqn (59), and whether this ap- 
proach might not disclose other topics of interest. We shall see that the notion 
of an optically pumped vapour as a macroscopic medium having an aniso- 
tropic, time-dependent, complex refractive index is indeed useful. 


2.1 ABSORPTON, REFRACTIVE INDEX, ELECTRIC POLARIZABILITY 


The absorption coefficient of a medium is related to the imaginary part of the 
refractive index n™, This may be obtained from the bulk polarization and this, 


478 G. W. SERIES 


in turn, from the polarizability of individual atoms. We start from an expres- 
sion for the polarizability tensor, «/) of the jth atom, where r and s refer to 
the base vectors. 


e4, = FCGti)/2?, eo =k. (61) 
(i, j, k, are Cartesian unit vectors). 
We have: 
. > A,s( U's 13M) 
Ok, t= a(t) a, * (8)... 62 
where A,(u’, Hs m) = <p'leX . P|lm><mle,. P|u>, 


k is the frequency of the incident light, m, m’ ... are excited states yu, p’ ... are 
- ground states, k,,, k, are Bohr frequencies, I,, is a damping constant. (Details 
of the calculation are to be found in Series.°°) 

The bulk polarizability tensor, «,,, is calculated by summing (62) over the 
N atoms per unit volume. These have a distribution of resonance frequencies 
Kmy = Km—k, given by 


dN = (N/An*) exp [— {(6km,)7/A7}]dk my » 
where A is a measure of the Doppler width of the distribution, and 
Ku = Kmyp—(Kmyo > With (k,,,)o the peak. 
In making the sum we replace a,(t) a*,,(t) by its value o,,,(t) averaged 


over atoms in ground states. The result is 


cele oo ea 


where Z(x+iy) is the plasma dispersion function 
: - 7 exp (—17) k—(Ky, Jo ne 
Z(x+i = «+f ———— dt 5) Xm, = ——;; = ; 
Cry) wo t—(x+iy) " A oo OR (64) 


(We have ignored the contribution to refractive index which arises from excit- 
ed atoms since, in optical pumping experiments, the bulk of the population is 
in ground states.) 


Ony(t) ZXmy tity), --- (63) 


a. Absorption 
The absorption coefficient, y, is related to the bulk polarizability, «, by 
Mrs = (4nk/c)ay? (65) 
where a is the imaginary part of w. The imaginary part of « arises from the 
imaginary part of Z (eqn (63)). 
Now, in exploring a resonance between ground states we are concerned 
with changes in the density matrix o,,,, which occur over relatively small varia- 
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tions of magnetic field, in the course of which Z remains practically constant. 
(The variation of Z is given by the variation of x,,,, with magnetic field, so we 
have to compare changes in the Zeeman splitting of the ground states, ok,,, 
with the Doppler width, A). 

{f we can treat Z as constant, the absorption coefficient is proportional to 


2. A(u, #3 mM) Oyy(0)- 


This is equivalent to eqns (59) and (60) (with m, m’ and y, yp’ interchanged, 
because these equations refer to emission from excited states). 

We have shown, therefore, that an analysis based on the classical theory of 
the propagation of light in dielectrics gives the same result for absorption, in 
optically thin vapours, as an analysis based on the interaction of light with 
individual particles. 


b. Dispersion 


We may expect that this approach would be rewarding if we considered the 
region away from absorption. We should then study the real part of eqn (63). 
The equation predicts that manifestations of optical pumping in ground states 
(resonances, modulation effects, etc.) could be studied by any technique which 
measured refractive index, since the optical pumping effects are described 
analytically by the density matrix, which is linearly related to the refractive 
index. Many such studies have indeed been made. The Faraday effect and 
magnetic birefringence, which measure various combinations of the tensor 
components of the anisotropic refractive index, have proved particularly 
useful. 

For these dispersion studies it is advantageous to use monitoring light whose 
spectrum is displaced in frequency from the absorption region of the medium. 
This light does not then contribute to the optical pumping, and the effect of 
the pumping beam itself can be studied free from perturbations due to the 
monitoring. 


2.2. EMISSION: RESONANCES IN EXCITED STATES 


Our discussion has been based on the constancy of the function Z over the 
region of magnetic resonance. While it is true that this is constant to a high 
degree of approximation for ground state resonances, Z is not constant for 
excited state resonances. These resonances appear in eqn (63) as changes in 
the value of the function Z itself, since it is only through changes in (Km,)o 
that the effect of the magnetic field on excited states shows itself. ((Kimy)o iS 
simply the frequency of the (u, m) Zeeman component of the absorption line). 
Now, changes of (kK»,)o are significant only if they are comparable with the 
Doppler width. A detailed study of the consequences of this shows that curves 
of magnetic resonance in excited states, as explored by techniques which 
measure refractive index, are Doppler-broadened. This is in sharp contrast 
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with such curves as ordinarily studied (e.g. in the Brossel—Bitter experiment), 
where the great advantage of the technique is the elimination of Doppler 
broadening. 

The difference between our present analysis and the earlier study of the 
Brossel—Bitter experiment is that we are now, for the first time, taking account 
of the possibility of coherence between light radiated by different atoms. In the 
Brossel—Bitter experiment, as in conventional experiments on resonance fluo- 
rescence, one studies the laterally-scattered light where there is no such co- 
herence. In our present analysis of refractive index the coherence in forward- 
scattering makes itself felt by Doppler-broadening of the resonance curves. 

There remains a further important point to be made in connection with 
forward-scattering. In optically thick vapours the transmitted light suffers 
multiple scattering and its intensity is not linearly related to the refractive 
index. A consequence of the multiple scattering is that the magnetic resonance 
curves experience coherence narrowing to a degree far in excess of coherence 
narrowing in the laterally scattered light. Indeed, zero-field level-crossing 
curves in sodium have been obtained by Hackett‘*!) which are actually nar- 
rower than curves obtained in lateral scattering. The analysis of this situation 
is complicated, and we shall not pursue it here. 

The important points are: 
(i) the result for emission is very different from that for absorption; and 


(ii) coherence narrowing of excited state resonances is very marked in 
forward scattering. 
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Coherence in Spontaneous Emission 


H. HAKEN, R. HUBNER AND K. ZEILE 


In this paper we present a problem which was treated a couple of years ago 
in the lectures on quantum field theory given by Haken. The problem is quite 
simple. However, we present it here, because it shows how a fully quantum 
mechanical formulation of the field can be used to describe the coherence 
properties of the field in such a way as to reveal a close analogy with the 
classical treatment. 

We begin by quickly repeating the Wigner—Weisskopf theory of spon- 
taneous emission. We adopt the same notation as used in Section 4 of the 
chapter by Haken in this volume and split the total Hamiltonian which des- 
cribes the single two-level atom, the field modes, and the interaction into 
two parts: 


H = Hy+H,, (1) 
where 
Ho =hva,'a,+ Yi ho, bat b, (2) 
i 
and 
H,=h yy (g,b,' a,ta,+g,* a, ayb,). (3) 


In H;, non-resonant terms are as usual neglected. We make the following 
ansatz for the form of the wave function 


O(t) = A(t) a,' Dp exp (—ivt)+ ¥ c,(t)b,' a,' Oy exp(—iw,t) (4) 
7 


This is reasonable, since the total excitation energy is conserved in eqn (4). 
We insert (4) into the Schrédinger equation and compare the coefficients of 
the wave functions which describe the atom in its upper state or the atom in 
its lower state, and an additional light quantum. We then readily obtain the 
following equations 

1 


A =—Yexgs* exp Liv—o,)] | 


(5) 
é, = 9, Aexp [(o,—1] | 
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We use the hypothesis of Wigner-Weisskopf which assumes that the ampli- 
tude of the atom decays exponentially 


A = exp(—yt) (6) 
and obtain from eqns (5) 


1 1—exp[—yt+i(@,—v)t] 
C= 794 y—i(@,—) (7) 


and in a selfconsistent manner 


, exp[yt—i(@,—v)t]—-1 

= — 8 
ja 5 Wil) (8) 
The right-hand side of eqn (8) can be evaluated for small values of y, using 
the formula 


lim 1—exp Liv—@)t P : 
lim Joexp ont = —— - ind (v-a) (9) 


and yields, in fact, a time independent quantity. So far we have merely 
reviewed the work of Wigner—Weisskopf. 

Let us now interpret the results. In the Wigner-Weisskopf hypothesis, 
eqn (6), it is assumed that the upper state of the atom decays exponentially, 
but from this, it is hard to see how any coherence properties of the light field 
arise. 

The conclusion usually drawn from the above solution is the following. 


= | |c,|2 doo dQ25 (v—@) (10) 
Aa 


represents the transition rate into photon states in a frequency interval Aw 
over an angle AQ, under conditions of energy conservation. Thus this quan- 
tity gives us a counting rate, but because it refers to photon numbers it says 
nothing about coherence properties. In order to find out what the coherence 
properties of the field are, we now apply the formalism of correlation func- 
tions. These have been extensively discussed in the chapter by Glauber in 
this volume. While Glauber discussed mainly the properties of correlation 
functions describing thermal (completely chaotic) light, we will now use 
this formalism for fields which are still coupled to their source. We first 
consider the following coherence function 


<®* (0) E,*? (x, t) B(0)), (11) 
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where we use the decomposition 


E,O (x,t) = exp (Ht) -= YVOrhe,) (6. 


x exp[—ik,.x]b,' }exp(—iH?). (12) 


H is the total Hamiltonian, (1), which acts on the creation operators b,' 
and transforms them from the Schrédinger representation into the Heisen- 
berg representation. E‘*? is defined by the Hermitian conjugate of (12). In 
order to evaluate eqn (11), we merely have to calculate expressions of the 
form 


<®* (0) b,,(t) (0) =<@* (0) exp (iHt) b,, exp (—iHt) ® (0)>. (13) 


This can be converted, by going into the Schrédinger representation, 
into 


(*(1)b, O(1)>. (14) 


When we insert the explicit wave function (4), we find that (13) vanishes, on 
account of the orthogonality of the unperturbed wave functions. At first 
sight one might conclude from this result that it implies that there is no 
coherence present at all. However, such a result could also be obtained in a 
completely classical theory if we averaged over all initial phases. Therefore 
we need the next order correlation function, in which the initial phase cancels 
out and which is defined by 


I (t+1, t) = (O* (0) EO (x, t+) E™ (x, 1) ®(0)>. (15) 


Again we indicate briefly how to evaluate eqn (15). We insert eqn (12) 
and its Hermitian conjugate into (15). From the point of view of investigat- 
ing the quantum mechanical aspects, we must evaluate expressions of the 
form 


<* (0) b,,' (¢+1)b,, OOO). (16) 
We again pass to the Schrédinger picture which leaves us with the expression 
<O* (t+) b, 1 exp (—itH) b,, (D>. (17) 


Using the explicit form of the wave function (4) we find, for the last operator 
b,, acting on O(f), 


Ca, (t) ay* Do exp (—iw,, t). (18) 
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Now the operator exp (—itH) yields just 1 when acting on (18), because there 
are no other transitions allowed. That part of (17) which stands on the left- 
hand side of the exponential operator can be evaluated in the same way as 
the right-hand side of (17), which led to (18). Using these results we find for 
(16) the following expression: 


c4,* (t+) c,,(t) exp Lim, (t+) —i,,t]. (19) 
In order to evaluate (15) using this expression, we see that it is necessary 


to perform a double sum over /,, 4,. This summation can be done using 
methods of ordinary classical physics, and we obtain finally 


4 : 2@ 
P(t+t7,)=2 _ exp| -2y(1-) | — |(8,2)7| exp (—ivt) exp (— yt) 


for ct—r>0 
=0 for ct-—r<0O (20) 


In (20) we have neglected y? compared with v. 6,, is the atomic dipole 
moment matrix element. © is the angle between the atomic dipole moment 
and the direction of observation. 

The amazing thing is that formula (20) could have been obtained equally 
well by using a completely classical treatment of the radiation field of a 
damped Hertzian oscillator. The result shows that the field E has an internal 
coherence time 1/y which appears here, multiplying t. Our result differs from 
one which is sometimes expressed, that coherence properties do not show 
up in the spontaneous emission of photons, because the photons have no 
phase. In fact, formula (10) does not contain a phase term. On the other hand, 
if measurements are performed which allow us to determine correlation 
functions of the form (15), we find exactly those coherence properties which 
we would expect from a classical treatment of a field with an internal coherence 
time 1/y. 

We should add two warnings or remarks, however. Our result should not 
be interpreted as meaning that quantum electrodynamics is unnecessary and 
can be replaced by a fully classical theory. Finding one such analogy, or even 
several, is, of course, not sufficient, and quantum field theory could be dis- 
pensed with only if all relevant results could be explained in a classical 
theory. We would also want this new “classical” theory to be at least as 
simple in formulation as the old quantum electrodynamical theory. Our 
main conclusion from our above treatment is that quantum electrodynamics 
because of its inherent dualism between particle and wave aspects, is suffi- 
ciently versatile to account for finding photons in spontaneous emission, 
and also to account for the coherence properties of the fields in this process. 
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The second remark is that one might be tempted to interpret the result as 
follows: if one believes in the field as a real quantity, the coherence properties 
will automatically follow, since coherence is a property of the field itself. 
On the other hand, the correlation function (15) could be measured by 
photon counting, and the measurement would imply the interaction of light 
and matter. From this, one would conclude that only matter occurs in 
reality. We prefer to leave the discussion of this point open however, because 
the choice of the field as the real quantity has at least simplified the interpre- 
tation of our results. 


Quantum Statistics of Optical Parametric 
Oscillation t 


R. GRAHAM 


The subject of this paper is a rather old one; parametric oscillation was 
first discovered by Lord Rayleigh" in 1883. Nowadays the term has a wider 
usage. Any oscillation of a system with time-varying parameters is called a 
parametric oscillation. Here, however, I want to confine myself to the 
simplest case. 

Let us consider two slightly damped harmonic oscillators (we call them 
“signal” and “idler” in the following) with frequencies a, and @,, and 
damping constants x, and x2. Let these two oscillators be coupled by some 
“parameter” and let us modulate this parameter with a harmonically varying 
force F, of frequency 3. A simple calculation then shows that, provided 
the matching condition 


|w3 — @, — @2| < Ky, Kz (1) 


is fulfilled, the effective damping in both oscillators is reduced, and above a 
certain threshold value of F, self-sustained oscillations take place. Below 
threshold this device is called a parametric amplifier, above threshold we 
have the oscillator region. If the two harmonic oscillators of the system 
coincide, i.e. degenerate into a single oscillator, we call it degenerate para- 
metric oscillation or subharmonic generation. 

Examples of parametric oscillators in everyday life are, of course, numer- 
ous. An ordinary swing serves as an excellent example. If you sit on a swing 
and excite it by raising and lowering your centre of gravity at twice the swing 
frequency, you are then part of a degenerate parametric oscillator. 

So far I have not mentioned a very important point, which is essential 
for parametric oscillation noise. You can sit on your swing and raise and 
lower your centre of gravity for a long time: nothing will happen if there is 
nobody nearby to give you the first push, or alternatively, if there is no 
other initial random excitation of the swing. In classical physics, one could 
imagine a situation in which there are no fluctuations. However, the situation 
changes completely if signal and idler are quantized. Vacuum fluctuations of 
these oscillators will always be present and these are sufficient for the build- 
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up of self-sustained oscillation. Thus, a quantized swing would easily be 
excited without the help of other initial excitations. 

Therefore, to study the part which noise plays in parametric oscillation, 
it is helpful to set up an oscillator in a region where quantum effects should 
be measurable. Thisgis at least partially achieved in the optical region. 
In the optical region, signal and idler oscillations are provided by two modes 
of the electromagnetic field. (The idler mode may be some other elementary 
boson excitation, e.g. an optical or an acoustical phonon, in which case we 
have stimulated Raman and Brillouin scattering respectively). The coupling 
parameter is the susceptibility of the medium in which the electromagnetic 
modes propagate. If we chose a medium whose susceptibility depends 
linearly on the electric field strength E, we can modulate this coupling para- 
meter by shining in a powerful beam of laser light of frequency w,. A simple 
experimental scheme is shown in Fig. 1. One of the first realizations of this 
scheme was achieved by Giordmaine and Miller. 


Wa Wi, Wo 
a SSS Xe aa ee - — — —-| Detector 
a 


Mirrors transparent at ws, 
reflecting at w, wo 


Fig. 1. Scheme of optical parametric oscillator (doubly resonant type). 


Generation of optical signal and idler modes in this device is possible only 
if there are resonant cavity modes which fulfill the phase matching condition 


|k,; —k, —k,| < x/L (2) 


for the wave vectors k, of the three modes involved. To achieve phase 
matching, it is necessary to compensate for dispersion in the crystal. This 
can be done by using a double refracting crystal and polarizing the pump 
in the extraordinary direction. It is then possible to choose ordinary signal 
and idler waves which satisfy both the energy and momentum matching 
conditions. 

Because of the central importance of noise in this problem, we wish to 
know the statistical properties of the signal and idler output of the para- 
metric oscillator. The experimental answer to this question has not yet been 
given, because experimentally the parametric oscillator is difficult to control. 
In most cases, the output contains several clusters of signal/idler pairs, 
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because the mode selection of this device is rather poor.“ However, the 
situation is improving, and there have recently appeared several papers on 
high efficiency parametric conversion.) Using these techniques together 
with a mode selecting device, it may be possible to perform the appropriate 
experiments. However in the following discussion I shall answer the question 
theoretically. 

The Hamiltonian of the system shown in Fig. 1 may be written 


3 
A = p> h QO, b; b, a2 Hl ys a5 FL erystetield + Fl ine (3) 


+ Agath + FA yath-tield = FA yath-cryst* 


We have introduced here the creation and annihilation operators, b' and b 
for the three modes in the cavity. The first term in the Hamiltonian describes 
the free motion of these modes. The free motion of the atoms of the crystal 
and their coupling to the field modes are described in the second and third 
terms. In the fourth term, the mode with frequency @ is directly coupled 
to an external pump force F, 


h : ; 7 
1: ne = Pa (F,* exp(i@3f) b, = ¥ exp(—i@3f) b,'). (4) 


There are losses in the system. The pump mode leaves the crystal without 
reflection, and the signal and idler modes are damped by mirror and diffrac- 
tion losses. These losses are described by coupling the field to heat baths. 
The fifth and sixth terms in the Hamiltonian are introduced to take care of 
the free motion of the heat baths and their coupling to the field modes. The 
crystal itself is also coupled to its surroundings, and this is described in the 
last term. 

In general the solution of this problem is very complicated, but Bloem- 
bergen has shown that the problem may be greatly reduced if we know which 
physical process is most important and dominant in the interaction of the 
field and the crystal. In this case, although the field modes are coupled via 
the crystal, the crystal operators themselves can be completely eliminated 
from the problem. We are then left with three field modes, with slightly 
shifted frequencies @,, interacting with each other via a nonlinear suscepti- 
bility, and slightly modified heat baths.‘ The effective Hamiltonian is 


Hog = >; how, b,tb, + ihB (b,' ba! bs — bs by ba) 
7 


h ea — e 
TF (F,* exp(ia@3t) b; — F, exp(—iw3t) b3') + Hétin + Wettn-rieta (5) 
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Detuning effects are neglected for simplicity. 8 is a real coupling constant, 
and is proportional to the nonlinear susceptibility of the crystal. 

Let me clarify the reasons why this effective Hamiltonian exists and under 
what circumstances. Very often in quantum field theory one has the task of 
eliminating some of the underlying field operators, in order to end up with 
an effective interaction between the remaining fields. However, it is not 
always possible to write down an effective Hamiltonian for this interaction, 
i.e. to achieve the elimination at least approximately by some unitary trans- 
formation. As a general rule, an effective Hamiltonian exists if only virtual 
excitations of the eliminated fields occur in the problem under considera- 
tion (compare for example, the elimination of the virtual meson excitations 
in nuclear force theory or of the virtual phonons in the theory of supercon- 
ductivity). If, on the other hand, real transitions of the eliminated fields are 
involved in the problem, damping phenomena have to be taken into account 
to avoid divergences, and then the effective interaction cannot be described 
by an effective Hermitian Hamiltonian, because of the irreversibility of the 
damping phenomena. A good example of the latter case is the laser. Here, 
real electronic transitions (see Fig. 2a) are the origin of the nonlinear satura- 
tion and mode coupling effects. In fact, the elimination of the atomic vari- 
ables‘®) from the laser problem leads to equations of motion for the mode 
amplitudes alone (e.g. the van der Pol equation), which cannot be derived 
from an effective Hamiltonian. 


Energy 


| 
(a) (b) 


Fic. 2 a. Real electronic transitions in a two-level laser atom, causing mode coupling 
effects. b. Virtual electronic transitions in a two-level atom in a medium with nonlinear 
susceptibility, causing the parametric interaction between pump, signal and idler. 


In the parametric oscillator case, we have mostly only virtual electronic 
transitions (Fig. 2b). This is why we can write down an effective Hamiltonian 
in this case (and in similar cases, for example, second harmonic generation or 
frequency conversion). If, however, one of the virtual transitions in Fig. 2b is 
tuned to resonance with the atomic transition, it becomes a real transition 
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and the effective Hamiltonian (5) is invalid. In this case only effective equa- 
tions of motion for signal, idler, and pump can be written down, since there 
is no Hermitian Hamiltonian.” In the following, we treat only the non- 
resonant case shown in Fig. 2b, where (5) gives a complete description. 
We may obtain some insight into the physics of the process by comparing 
the interaction part of the effective Hamiltonian 
Hitt = ihB(by' byt b3—b, bz b3') (6) 


int 


with the interaction part of the two-level laser Hamiltonian 
Hs = g )\ (ba,'+5' «,). (7) 
a 


Here «,' is the raising operator for the y-th two-level atom, «, the corre- 
sponding lowering operator. If we set up a correspondence between the 
signal and laser modes, i.e. 


bob; byt oot (8) 


then we also have the following operator correspondences 
bt bso > a,; b, bi oD a, (9) 
B B 


bot b describes the annihiliation of a photon in the pump mode with the 
simultaneous creation of a photon in the idler mode, thus corresponding to 
a transition from the higher energy level hw to the lower energy level ha). 
The operator )° «, describes the corresponding process for electrons in the 


two-level laser ‘atoms. 

In the laser we have many independent electrons, pumped into the upper 
state of many independent two-level atoms. A light mode is created by 
stimulated transitions into the ground state. In the parametric oscillator we 
have many photons, pumped into a mode with energy ha which create a 
light mode (the signal) by stimulated transitions into a mode with lower 
energy hw, (the idler). This process is called stimulated parametric emission. 
Noise in the laser system is caused by the atoms making spontaneous transi- 
tions, and similarly in the parametric oscillator system, noise is caused by 
spontaneous transitions of photons from the pump mode to the idler mode 
with the emission of a signal photon. This is called spontaneous parametric 
emission, and is induced by the vacuum fluctuations in the signal and idler. 

In the limiting case of an infinite number of atoms in the laser, the corre- 
spondence of the interaction Hamiltonians goes over into equivalence, since 
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then the operator equation 


oO 


yo a, = 65° b, (10) 


w=1 


satisfies all the properties of the operators on both sides of the equation. 
The other terms in the complete Hamiltonians are still different, so that the 
results differ in their finer details, but the overall results for the parametric 
oscillator show a strong analogy with the laser results, even in the case of a 
finite number of laser active atoms. 

Let us now start from the effective Hamiltonian and derive some more 
explicit results. First we derive a master equation for the density operator 
p of the three mode system, by eliminating the effective heat baths in the 
usual way. 


We obtain, in the‘ interaction picture 


0 
a {|e [," bot bs, p\+F, [b3', p] 


+ 2, (x, @i,+1) [b,9, 6,4] +-«, 7, [bt p, »,)| (11) 


+ [Hermitian conjugate] 


x, are the damping constants for the three modes, fi, is the sum of the 
number of thermal quanta at frequency w, and the number of quanta which 
are emitted spontaneously by the crystal atoms into this mode. At optical 
frequencies and realistic temperatures, fi, is a very small number compared 
to 1. 

The next step is to introduce a quasi-probability distribution function for 
the statistical operator. We could use the P-representation, the Wigner- 
function or the diagonal elements of p with respect to coherent states—all 
these quasi-distribution functions contain the same information as the 
statistical operator itself, and all multitime correlation functions may be easily 
calculated from them.“” In the parametric oscillator case we expect very 
strong correlations between signal and idler. Consequently the P-represen- 
tation does not always exist “?» ®) so instead we use a representation which 
does: 


O({a,}) = <y <a} 1 0 faa}), (12) 
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where 
b, | {a,}> = 4, | {a,}>. (13) 


Once we have calculated this function from the master equation, it can be 
used to calculate normally ordered multitime correlation functions.“!!) All 
we have to do is to make the following substitutions 


0 e 
b,t _ (2,4 Oa, ); b, md («+35) (14) 


and to use Q as if it were a classical probability distribution.“ For Q we 
obtain the Fokker—Planck type equation 


dQ a y 
Bo ES (K, 4, —B a2* a3) ae (Kz a — Ba,* a3) Q 
i) 
te Bye (k3%3+Pa,a,—F,)Q 
ee) 
2 3 raha @) 
a | a n,+1).——— > 
P Oa, Ou “3 Q +2 cana Ya, da ,* | 
+ [Compl. conj.]/. (15) 


This equation is still exact (within the approximations of the effective Hamil- 
tonian) and contains derivatives up to second order only. In the laser theory 
we obtain a generalized Fokker—Planck equation at this stage,‘13) which 
contains derivatives up to infinite order. This is because the number of laser 
atoms is in fact finite, whereas the interaction parts of the Hamiltonians are 
equivalent only for an infinite number of atoms. 

Let us now consider the second order derivatives of our Fokker—Planck 
equation which describes the fluctuations. The last term comes from the 
heat baths. The first term containing second order derivatives comes from 
the interaction part of the effective Hamiltonian, and shows that even with- 
out the heat baths, fluctuations are present due to the parametric interaction 
of quantized modes. This is just spontaneous parametric emission. 

The procedure which we have followed up to now is quite general and may 
be applied to most of the processes in nonlinear optics. It is especially well 
adapted to display the dual consequence of nonlinearity; the stimulated 
coherent effects show up in the first order derivative (drift) terms of eqn (15), 
and the corresponding spontaneous incoherent effects show up in its second 
order derivative (diffusion) terms. 
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We can solve the Fokker—Planck equation by making some approxima- 
tions. Firstly we assume that the amount of pumping is weak. The ampli- 
tudes of signal and idler will then be small and we can neglect the interaction 
of these modes with the pump mode. The pump mode amplitude is therefore 
fixed by the driving force and the losses, and is independent of the signal and 
idler fields generated. This is the parametric approximation. A theory of the 
parametric amplifier in this approximation has been given by Wagner and 
Hellwarth.“” If we also let the damping constants go to zero (this case was 
first treated by Louisell, Yariv and Siegmann “'* 1”), we obtain exponential 
gain for both signal and idler. However, this approximation will hold only 
for a short time at the beginning of the process. 

If we include damping and use the parametric approximation we obtain 
a combined linear Gaussian process for signal and idler. The solution for Q 
is then a 4-dimensional Gaussian, centred about zero. If | F, |? < #37 Ky K2/B? 
then the Gaussian has real width. This is just the condition of weak pump- 
ing. The right-hand side of this inequality gives the threshold value of | F, | 
for which the system becomes unstable and the parametric approximation 
will then break down. 

We are now able to calculate all interesting multitime correlation functions. 
According to the Wiener Chinchin theorem, the spectrum of the signal 
mode is given by the Fourier transform of the correlation function 
<b, (t + 1) b, (t) >. Below threshold the signal spectrum is a superposition 
of two Lorentzians, with the line centre at w,. One line is weak and broad 
(linewidth 4,), the other line is much more intense and narrow (linewidth 
4,). The half-widths at half-power are 


2 K,+K)* F,|? 
Ap = 5 ag [ane + (pax, a) (16) 
Zz K3 
(see Fig. 3). 
At threshold, 4, is zero and the output power 
Py = 2h@, K, (b,' (t)b, (t)> (17) 


diverges. Of course the linear theory breaks down long before threshold. 
We can eliminate the coupling constant B from eqn (16) and introduce 
instead P,, defined by (17). Not too far below threshold, 4, depends on 
| F, | mainly through the dependence of P, on | F,|. Inserting the threshold 
value of | F, | everywhere, but not in P,, we obtain 


hw, (Kk, K2)? 
A a5! 1 1 K2 
: Py, («,+K2)? 


(a, +n, +1). (18) 
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This expression is in complete analogy with the expression for the laser line- 
width. Corresponding expressions hold, of course, for the idler spectrum. 

Let us now consider two possible Hanbury-Brown-Twiss experiments. In 
the first experiment we absorb the idler frequency in a filter and measure the 
intensity correlation function of the signal mode, i.e. the quantity 


«bt (t) bf (t+) by (t+1) 5, ()> 


K,,() = (19) 
ii | <b,t (2) b(t) > I? 
signal linewidths 
Ay Ag 

BEG, WR ee a re ae eee eg 

Ko 

a 

threshold 


2 
2 | Fp|2——> 
1°2 °3 


Fig. 3. Linewidths A, 42, below threshold, as a function of the pumping force intensity 
(obtained in the linear theory) shown here for x2 > ,. 
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In the second experiment we measure the intensity of the signal mode in one 
detector and the intensity of the idler mode in a second detector, and cor- 
relate them. Thus we would measure the quantity 


< bt (t) byt (t+-1) by (t +7) b, (t)> 
<b," (t) by (t) > <2" (4) bp (t)> 
We may calculate K,, (t) and K,, (t) from the solution of the Fokker—Planck 


equation. However, as the general expressions are rather lengthy we shall 
discuss here only the following two limits. 


K,,(t) = (20) 


(a) Not too far below threshold, only the intense and narrow line of the 
spectrum will contribute and we obtain 


Ky, (1) = Ky2 (t) = 1+exp (— 24, |T)). (21) 


This shows that in this region of operation, we should observe a Hanbury- 
Brown-Twiss effect for signal and idler in their autocorrelation functions 
and in their crosscorrelation function. The width of the correlation function 
approaches zero when the pumping force F, approaches the threshold value. 
If we let the pumping force F, tend to zero, we obtain (if x, # K2) 


Ky1(t) = 1+exp(—2A, | t]) (22) 
B°\F, |? (“oe 


K3” (Ky —K»)? Ny N2 2 


x (exp(—2,|t|)+exp(—Az{t)))?. (23) 


2 
Kio (t) = 1+4 +k, ny = K2 ra) 


We still observe a Hanbury-Brown-Twiss effect in the autocorrelation func- 
tion, whereas it disappears in the crosscorrelation function, since signal and 
idler will be statistically independent (weak) beams in this limit. 


(b) We now consider the much more interesting case well above threshold, 
when depletion of the pump cannot be neglected. It is easy to show” that a 
steady state is reached, in which the pump mode is completely saturated 
and depleted to its threshold value (i.e. independent of F,) whereas the 
intensities of signal and idler are determined by F,. 


Ka [F,| K3VK, K> 
Se a (He), 24 
< 1 > Ky B p? ( ) 
Their relative intensity is determined by the Manley Rowe relation) 
be 
oe aan ge (25) 


Wy 20) 
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The statistical properties of the parametric oscillator are contained in small 
phase and amplitude fluctuations about this steady state. Thus, we quasi- 
linearize the problem with respect to these small fluctuations and obtain a 
solution of the Fokker—Planck equation for the resulting linear Gaussian 
process. The spectral linewidth of the amplitude correlation function of the 
signal is found to be half that given by expression (18). This linewidth is 
due to slow undamped stochastic motion of the phase difference between 
signal and idler, arising from spontaneous parametric emission. The sum 
of signal and idler phases is stable and locked to the phase of the pump. 

We are again interested in the theoretical result of the two Hanbury- 
Brown-Twiss experiments proposed above. As the general expressions for 
the correlation functions are again rather complicated, we give them only in 
the region not too far above threshold, i.e. in the limit 


K2 Ky 


(bit b> <« 4R2 


Since in general, x, «3/4? is a very large number, this limit covers all cases 
of interest. We obtain 


hao, \? 2,7? Ky? kz (Ny +24+1) 
K,,( =1+($+) exp (-AIt]), 26 
ho ho 2K 2k are (74, +724+1) 
Kit =1+( +) ( 2) ex —Alt 2 
20 re Oo eS p(—alrl) 27) 
with linewidth 2 
2 
ee B (28) 


AD, K3(Ky+K2) 


This result shows firstly, that signal and idler both have the same statistical 
properties as a laser mode. For both modes, the Hanbury-Brown-Twiss 
effect dies out if we go sufficiently far above threshold. The intensity cor- 
relation function then factorizes. Secondly, eqn (27) demonstrates that the 
Hanbury-Brown-Twiss effect also disappears for the crosscorrelation func- 
tion sufficiently far above threshold. In fact, besides a trivial normalization 
factor, K,>(t) equals K,,(t). This shows that photons in the signal and idler 
modes interfere with each other as if they were photons of one coherent 
mode. 

We can make a qualitative comparison between formulae (23) and (27). 
Both demonstrate the disappearance of the Hanbury-Brown-Twiss effect in 
the crosscorrelation function, but the reasons for the factorization of the 
crosscorrelation functions are exactly opposite in the two cases; in (23) the 
reason is complete incoherence, whilst in (27) it is complete coherence. 
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Finally, I want to mention that it is also possible to obtain a solution in 
the threshold region,“ by symmetrically applying an adiabatic approxima- 
tion to signal and idler, and eliminating the idler and pump amplitudes. 
This procedure leads directly to the Fokker—-Planck equation for the van der 
Pol oscillator in the rotating wave approximation, which one also encounters 
in the laser theory. Therefore solutions for the photon distribution of the 
signal mode, the transient distribution etc. may be written down immedi- 
ately. However, it has not been possible so far to derive an analytical theory 
for the threshold region, which also contains information about correlations 
between signal and idler. Some further work in this region seems to be 
necessary, especially when experiments on the photon statistics of parametric 
oscillators will be done some day in the future. 
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Radiation from a System of N Two Level Atoms 


D. F. WALLS 


There exists a close formal analogy between radiation from a system of N 
two level atoms“ ~* and the parametric amplification or down-conversion 
process of nonlinear optics.°~”) Parametric amplification in a nonlinear 
dielectric may be described by the following Hamiltonian, 


H =H oth, (1) 


where 
Hy = ho, a'at+ho, b'b+ho, cle 
H, = hx (a btct+at be) 
(a, b, c are the boson annihilation operators for the pump, signal and idler 
modes respectively. « is the coupling constant and w, = w, — w,). The 


interaction of N two level atoms with one (resonant) mode of the radiation 
field may be described by the Hamiltonian (1) with, 


Hy = ha cte+ho J, 
H, = hx (cJ 4. +c'J_) (3) 
(J,= i e,', Jy = Xo", J-= Yor, 
where o, are the z components of spin, and o*, o~ the spin flip operators 
for the individual atoms). The angular momentum operators J,, J,, and J_ 


may be represented in terms of two operators a, b obeying boson com- 
mutation relations.“ 


(2) 


J, =4 (ada — bt) 

J, = bat (4) 

J_=bta 
With this representation the Hamiltonians (2) and (3) become formally 
identical. We may make the identification; laser pump equivalent to upper 
atomic level, signal photon equivalent to lower atomic level, idler photon 


equivalent to emitted photon. One notes that the operators a and b differ 
from boson operators in that the eigenvalues of ata and btb assume integer 
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values from 0 to 2|J|. If the total angular momentum J of the N atom 
system is a maximum (i.e. J = N/2) the eigenvalues of ata and btb are the 
actual occupation numbers of the upper and lower levels. This is the case we 
shall consider. If J < N/2 the eigenvalues of ata and btb are the effective 
occupation numbers. Only in the case of a large ensemble of atoms, No, 
may the operators a and 5 be identified as true boson operators. 

The Heisenberg equations of motion resulting from the Hamiltonian (2) 
or (3) are nonlinear operator equations. Rather than attempting a direct 
solution of these equations, we shall search for constants of the motion and 
solve the Schrédinger equation for the system. Under conditions of perfect 
energy conservation, w, = @, + @,, it can be shown that the Hamiltonians 
H(t) and H,(t) commute with each other 


[Ho(¢), H1(4)] = CA), Ho(t)] = LA), H1(2)] = 0 (5) 


Thus a representation exists in which the total Hamiltonian H(r) is diagonal. 
Once the Hamiltonian has been diagonalized, the Schrédinger equation has 
been solved. 

The eigenstates of Ho are the number states of the form |n,, n,, n.>. If 
we assume the initial state is |”,, 2,, 0>, the possible eigenstates of Hy are 


Y = (|n,, m, OD, |ng—1, ny +1, 1D... 10, mg try, 1>)*. (6) 


We shall attempt to express the eigenstates of H, as linear combinations of 
the eigenstates of Hy. The result of H, operating on ¥ is 


H,¥=h« 4, ¥, (7) 


PROBABILITY DISTRIBUTION P(n,d) 


Y cae RES { L 
6 12 24 


n 


Fic. 1. Plots of p(x,A) against m for ng = 24. 
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where A,, is the (n,+1)x(n,+1) symmetric matrix 


0 a, 
a, 0, 
~ Ba 
A,, = rar (8) 


anda, = [(#,—r+1) (4, 4+7r)]*, r= 1,2... 


This matrix may be diagonalized numerically and the eigenstates and eigen- 
values of the system found. This enables one to calculate the mean number 
and the probability distribution of the emitted photons. This procedure is 
described in more detail in Walls and Barakat.‘” 

We shall consider first the results for the case of spontaneous emission 
M = J (n, =n, = 0). In Fig. 1 several examples p (, A), the probability that 
the eigenstate having eigenvalue J contains n emitted photons, are plotted for 
M = J (n, =n, = 0). In Fig. 1 several examples p (n, 4), probability that the 
eigenstate having eigenvalue 1 contains n emitted photons, are plotted for 
n, = 24. The eigenstates with A = +/,,,,, that is, the ground state and most 
highly excited state, have approximately a Poisson distribution in photon 
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Fic. 2. Mean number of emitted photons as a function of t, for various values of n as 
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number. The mean number of emitted photons is plotted as a function of 
t= xt in Fig. 2 for n, = 1, 2, 9, 23, 24. We observe the oscillatory (almost 
periodic) behaviour, in direct contrast to the exponentially increasing solu- 
tion predicted by the parametric approximation. 7) The fluctuations which 
appear are due to the amplification of the vacuum fluctuations which have 
a large effect for this low number of atoms. For a large number of atoms, 
the vacuum fluctuations, though initially the source of the field, are quickly 
masked and the mean number of photons follows an elliptic function. 

4 


PROBABILITY DISTRIBUTION P(n,T) 
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Fic. 3. Probability distribution of emitted photons at several different times, for 


Ng = 24 


MEAN NUMBER OF IDLER PHOTONS 
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Fic. 4. Mean number of emitted photons in stimulated emission, for 2, = 24 and 
increasing values of 71». 
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The probability distribution of the number of emitted photons is plotted 
in Fig. 3 at several fixed times for n, = 24. For very short times (t < 0-05) 
the probability distribution follows a chaotic or power law distribution. 
However, as the time increases the distribution becomes peaked about 
increasingly higher values of 7,, out to the first maximum of ,. 

In Fig. 4 the behaviour of the mean number of photons for the case of 
stimulated emission with n, = 24, n, = 1, 4, 24 is shown. We observe a con- 
siderable smoothing of the behaviour of 7, (t) as n, is increased. For n, = 4, 
the mean number of idler photons is already very nearly periodic. Thus a finite 
number of atoms in the ground state serves to mask the effect of the vacuum 
fluctuations. 


PROBABILITY DISTRIBUTION P(n,T) 


Fic. 5. Probability distributions of emitted photons for the super-radiant case, 
plotted at several different times. 


For nq = ny, (i.e. M = 0) the super-radiant state, 2,(z) follows a periodic 
classical type behaviour. The probability distributions of the emitted photons 
for the super-radiant case are shown in Fig. 5. For very short times (t <0-05) 
the probability distribution follows a Poisson distribution. This result is 
expected after observing that in this time range the source of the radiation 
field (i.e. the transverse component of the total angular momentum J) is 
essentially classical. 
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Non-Adiabatic Effects in the Laser’ 


F. HAAKE 


As has been pointed out by Haken, laser theory at the most fundamental 
level has to deal with a system having a tremendously large number of 
degrees of freedom; the active atoms interact with the electromagnetic field, 
and loss and pump mechanisms must be coupled to this “proper” laser 
system. There is no hope of obtaining an exact solution of the corresponding 
dynamical problem. It has been an important first step in all laser theories to 
make an approximation to eliminate the coordinates of the loss and pump 
mechanisms. The Markoff approximation has turned out to be completely 
sufficient for this purpose. The “proper” laser system is then described in 
such a way that it undergoes a quantum mechanical Markoff process. This 
can be done either by writing down the set of Langevin equations for atomic 
and field operators or, equivalently, the master equation for the atom-field 
density operator, or, in the framework of semiclassical theory, a Fokker- 
Planck equation for a distribution function of the atom and field variables. 

It should be noted that ali three types of equation resulting from this first 
step in laser theory still contain more information than is usually needed to 
explain the experimental results. These results are described in terms of the 
electromagnetic field alone and never in terms of the active atoms. It is there- 
fore tempting to eliminate the atomic variables too. This has been done 
many times using the “adiabatic” approximation, (this assumes that the 
atoms follow the motion of the field adiabatically). 

Let us consider two equations arising from such a procedure. Firstly, the 
well-known van der Pol equation 


b* + {k—aj+0, 5% b}b* = F*(t) 
with (1) 
CF (t)> = 0, CF* (t+1)F (t)> = 4¢6(2), 


and secondly the Fokker—Planck equation 


PBB.) = (6+ sp BY) (=a, + ty BB 


Q? 
+ 4g ae] PBB. 2) 
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In both equations we find terms referring to the damping of the field (x), the 
linear gain (a,), the nonlinearity («,,), and the fluctuations (q). We also see 
that the electromagnetic field here is assumed to have Markoffian properties. 
In eqn (1), this shows up in the 6-function correlation of the fluctuating 
driving force F*(t). On the other hand, this property is inherent in the 
Fokker-Planck equation (2). It is well known that the equations are equiva- 
lent. Equation (1) describes the quantum mechanical Markoff process in the 
Heisenberg picture and eqn (2) describes the same process in the Schrédinger 
picture. 

I will now put forward three objections to the adiabatic or Markoff 
approximation which underlies these two equations. 


1. As mentioned above, the “proper” laser system (active atoms + field) 
can certainly be treated as undergoing a Markoff process; but the field alone, 
as a subsystem, cannot. This is a familiar argument in the theory of stochastic 
processes. 


2. Intuitively, we may reason that the active atoms produce an indirect 
interaction between the photons of the lasing field mode. The situation is then 
similar to that in superconductivity, where indirect interaction between 
electrons is produced by phonons. The latter is well known to be a retarded 
interaction. The retardation effect is neglected in the BCS theory, but has 
to be retained when more sophisticated properties of super-conducting 
systems are studied. Correspondingly, an exact elimination of atomic vari- 
ables in the laser theory must lead to a retarded photon interaction. 


3. The problem of eliminating the atomic variables from the “proper” 
laser equation is somewhat analogous to the problem of eliminating heat 
bath coordinates when studying the interaction of a microscopic system 
(harmonic oscillator or atom) with a macroscopic heat bath. The essential 
condition for elimination of the heat baths in the Markoff approximation is 
that the heat bath should have a white or at least an extremely broad band 
energy spectrum. This is just the opposite of the condition sought after in a 
laser. 


These three objections do not help us to decide quantitatively if, and under 
what conditions, the adiabatic approximation is invalid. To do this, it is 
necessary to adopt a more sophisticated elimination method. Of course, it is 
possible to avoid any elimination at all and try to solve for example, the full 
set of laser Langevin equations. This procedure has been discussed in the 
chapter by Haken. However, here we use an elimination procedure which is 
based on Zwanzig’s projector formalism. 

Let a system be described by a density operator W(t) and consider the 
decomposition 
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W(t) = pW(t)+(1—p) W(t). (3) 


The algebraic quantity p occurring here may be defined as a projector 
acting upon W(t) such that: 


p> =p or (1—p)p=0 (4) 
Further, let W(t) obey an equation of motion of the form 
W(t)= -if W(t), (5) 


where £ is the Liouville operator, chosen according to the physical problem 
under consideration. Inserting the decomposition (3) into the right-hand side 
of (5) and then acting from the left, alternatively with p and (1—p), we 
obtain a system of coupled equations of motion for the two quantities 
pW(t) and (1—p) W(t) respectively. It is an easy matter to eliminate one 
quantity, say (1—p) W(t), from this set. Then the following inhomogeneous 
integrodifferential equation for the other quantity, pW (rt), is obtained. 


pW(t) = —ipLpW((t) + [asK(pwe-+500. (6) 


The inhomogeneity, .4(t), and the kernel, K (s), are given by 
I(t) = — ipL exp[—i(1—p) Lt] x (1-p) WO) 


K (s) = — p# exp [-i(1—p) #s] x (1—p) %. ” 


This formalism was first used by Zwanzig himself. He defined the projector, 
p, such that pW(t) is the diagonal part of the density operator W(t) with 
respect to some specific representation. However, for the present purpose 
we follow the method of Argyres and Kelley“ in which p is defined as the 
trace over atomic variables of the atom-field density operator for the laser: 


p=Atr, with tr,A=1 


8 

pW(t) = Atr, W(t) = Ap(t). © 
The “parameter” A is an operator in the Hilbert space of the active atoms. 
It can be chosen arbitrarily within the constraint of normalization which 
ensures that p is a projector. With this definition of p, eqn (6) can be regarded 
as an exact equation of motion for the reduced density operator p(t) of the 
laser field. Exact elimination of the atomic variables has thus been achieved. 
Of course, the Liouvillian Y has to be specified according to the laser prob- 
lem. Taking eqn (5) as the well-known laser master equation, Y becomes 
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F= LytLpt+LyaptingtiAg. (9) 


Here 24, £p, £4, are the commutator operations with the Hamiltonians, 
and refer to the free motion of the active atoms, the field, and the atom-field 
interaction, respectively. A, and A, represent the incoherent part of the 
motion of atoms and field according to pump and loss mechanisms (see 
Haken™ for an explicit description of these terms). 

The equation of motion for the field density operator, eqn (8), cannot be 
solved exactly. We therefore expand the kernel and the inhomogeneity as a 
power series, in terms of the interaction Liouvillian £,,, and truncate these 
series at some finite order. The expansion must be continued up to terms in 
fourth order of the atom-field interaction (¥*,,), since this is the term 
which contains the nonlinearity necessary to describe laser action. If the 
expansion were continued only up to terms in L?4,, this would be physically 
equivalent to considering only the action of the active atoms on the field 
and neglecting any reaction in the reverse direction. Since saturation effects 
in the atomic system are most important in the laser, this would be unreal- 
istic. 

When expansion is carried out to the correct order, it is found that the 
time behaviour of the kernel is characterised by four different retardation 
effects; . 


1. relaxation of the atomic dipole moment produces a retardation of 
the form exp(—y,5); 

2. relaxation of the atomic inversion contributes a retardation term 
exp (—7)5); 

3. the field damping occurs as exp (—xs); 

4. the inhomogeneous broadening of the atomic line appears as 


N N 
as 9,” exp [~i (v,-@)s], 2 9, exp [—-i(v,—@)s]. 
R= L= 


This is the most complex situation to be dealt with in laser theory. In order 
to show the essential features of the present treatment, we consider the special 
case of a homogeneous line in resonance (v, = w) and assume that the field 
damping is small compared to the atomic damping constants (x < Yio Pye 
for example, as in a gas laser). The resulting equation of motion for the 
field density operator, expressed in terms of the P-representation 


p(t) = | 4? BP(B, B*,1) 16> <BI 
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with respect to coherent states b|B> = B|B>, then becomes 


PBB =e B+ B+ Inseam Saar | PB PY 


op ape 


‘Opt aan | 
Se Ne aa ee a ee 


K (s) xP (B, B*,t—s) (10) 


+f a[(Z : p*) (- — 01; (8) + oq: #2 (8) B*B)+49¢(s) 


There is now a remarkable similarity with the Fokker—Planck equation (2). 
We again find terms describing the field damping, the linear gain, the non- 
linearity and the fluctuations. However, we see here that the last three effects 
occur under a time integral and that they appear retarded as required. The 
retardation functions ¢,(s), 62(s) are given by 


$1(5) = yxexp(—y,5) 


#6) = GAUL ean(- 78) — {1 +5. -yp} exp(—y)]- (1) 


In eqn (10), the inhomogeneity has been omitted. ¥(t) turns out to decay 
as exp (—y,f) and therefore does not play any role in the laser operating under 
stationary conditions. It is however, important for a discussion of transient 
effects and we see by inspection of eqn (7) that it depends on the initial 
conditions in the laser. Some correction terms to the kernel K (s), in particular 
those which contain field variable derivatives up to fourth order, have also 
been neglected. We can see that this is valid if we normalize the field variables, 
viz; 


The correction terms then contain 


J %nt_ = [photon number at threshold]~! <1. 
q 


We now turn to the solution of the integrodifferential equation (10). The 
kernel K (s) in this equation obeys the following differential equation, as a 
function of time 
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K(s)+Qy +7) KG) + v2 1 +29) K() +7127) K() = 0. (12) 


Differentiating eqn (10) with respect to ¢ and linearly combining the result- 
ing equations for P(t), P(t), P(t) and P(t) according to eqn (12), we find 
that the P-function obeys a fourth order differential equation in time. A 
simplified version of this equation becomes 


P(t) + (29, +74) PO +71 1 +29) PO +942 9) PO= 712 9, APCD, (13) 


where A is an abbreviation for the Fokker—Planck differential operator 
occurring on the right-hand side of eqn (2). 

Now we may discuss the non-adiabatic or non-Markoffian effects of the 
laser field mode. The eigenvalues and eigenfunctions of A are known from 
Risken’s work 


A fam (B, B*) = Yum Sm (B, B*); n=0, +1, Pepa: m=0,1,2,... (14) 


We are left with the problem of finding the time dependence of the general 
solution 


PB) = SSS Apu Tam (Sam Bs B* 


The secular equation for T,,,(t) has four roots which depend on the relative 
sizes Of 1, Yy> Yam. Near threshold, Yum < 1,7; for all experimentally 
interesting Ym. One root then practically coincides with y,,,, the others being 
very much smaller. In this case the second and higher order time derivatives 
in (10) are completely negligible and (10) reduces to the Fokker—Planck 
equation (2). On the other hand, the four roots become comparable in magni- 
tude as ,,, approaches ,, 71. FOr Yym Close to y,, y, the roots become com- 
plex. Results of Risken and Vollmer show that the latter condition can indeed 
be fulfilled sufficiently far above threshold for some ,,. AS a result, some 
observables (for example, the intensity correlation function) will no longer 
approach their equilibrium values monotonically but will show damped os- 
cillations. An experimental investigation of these effects should be very 
interesting. 
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Determination of the Statistical Properties of Light 
from Photoelectric Measurements 


J. PERINA 


1. INTRODUCTION 


It is well known that photoelectric measurements yield information about 
the statistical properties of fluctuating light beams. Arecchi, Pike and Mandel 
and their co-workers, together with many others, have done a lot of work in 
this field.*~**) Such measurements give us the distribution of photo- 
electrons emitted within a fixed time interval T, from a photodetector upon 
which the light beam is incident normally. This distribution is identical, as 
we shall show later to the distribution of absorbed photons and it can there- 
fore be called a photon-counting distribution. Alternatively, the measure- 
ments provide the moments of this photon-counting distribution, from 
which the factorial moments can easily be derived using the following 
relation, (2). . 

If <W*), is the moment of the probability distribution P -(W) of the time 
integrated intensity W, given by 


T 
w= [rear (1) 


then ' es 
WS), = (apr) = 2, P(n)n(n—1)... (n—k+1) 


=a in) +... ta <n, (2) 


where p(#) is the photon-counting distribution, <n*> are its moments, 
<n\/(n—k)!) are its factorial moments and 


a = (-)* > (4X2 eee X,-)), J = 1, 2, eee k—1, a =1, (3) 


The sum is taken over all choices of the numbers Xy, X2,..., X,x-~, from the 
sequence 1, 2,...,k—1 regardless of the order. 
However, in order to understand the statistical behaviour of light, 
especially laser light, we want to obtain information about the distribution 
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P,/(W) of the integrated intensity from the photon-counting distribution 
P(n) itself or its factorial moments. If the time interval T of observation is 
much smaller than the coherence time, Py(W) will give us the distribution 
P.,(1) of the intensity J. The main purpose of this paper is to solve this prob- 
lem, which from a mathematical point of view, means inverting the photo- 
detection equation 


wy" 


n! 


pn i, PW) exp (—aW) dW. (4) 


‘Here, the photon-counting distribution p(n) is expressed in terms of the 
integrated intensity distribution P,(W), and « is the photoefficiency of the 
detector. This equation was derived by Mandel“* on the basis of a semi- 
classical theory, and by Glauber“ on the basis of a quantum theory. 

We shall show first of all that the distribution of photons absorbed by a 
photodetector is of the same form as the distribution of emitted photo- 
electrons. This is a consequence of the definition of the number operator in 
the theory of the quantized electromagnetic field. We shall also show how 
the integrated intensity distribution can be determined from the photon- 
counting distribution or its factorial moments, using a generating function 
and orthogonal (Laguerre) polynomials. We will then give conditions which 
must be placed on the factorial moments of the photon-counting distribution 
(or on the photon-counting distribution itself), so that it is possible to recog- 
nize the distribution P,(W) as either a non-negative ordinary function or a 
generalized function whose support is composed of a finite number of 
points. 

Using a formalism of arbitrary ordering of field operators, recently intro- 
duced into quantum optics by Agarwal and Wolf,“ and, in particular, the 
parametrization of this ordering, introduced by Cahill and Glauber,“?” we 
can derive the general photodetection equation, which expresses p(n) in 
terms of the distribution P(W, s), related to s-ordering of the field operators 
(Perina and Horak“ ®), 

The methods described will be demonstrated by giving examples of one- 
mode and also multi-mode superposition of coherent and thermal fields 
(Perina and Mista,“® Perina and Horak®). In particular, we derive the 
quantities <W*>,, p(n), and P,(W), taking arbitrary mean occupation 
numbers per mode. As a special case we consider these quantities when the 
mean frequencies of the thermal and coherent fields are different. This 
situation arises in heterodyne detection of thermal light. Results obtained 
for s = —1 (antinormal ordering) could be useful for detection of the field 
by means of so-called “quantum counters,” which operate by stimulated 
emission rather than by absorption. 
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2. THE PHOTODETECTION EQUATION 


In order to show that the photodetection equation arises out of the proper- 
ties of the quantized electromagnetic field, we define the number operator 


fA = ys A;* (x) 4, (x) d3x. (5) 
J=1JSL3 


The detection operator, A(x) (closely connected with the operators of the 
electromagnetic field strength) is given by‘21:22) 


A(x) = L 3? x es (k) a, exp [i(k.x—ckt)]. (6) 
k s=1 


Here, 4, is the annihilation operator of a photon in the wave-vector spin 
mode (k, s) = 4; ek) is the unit polarization vector. We consider here the 
number operator defined over a normalization volume I3. We could equally 
well define this operator over an arbitrary volume V. All the results given 
below would still be valid (Mandel, ?*) Perina2”), 

Using (6) we can rewrite (5) in the form; 


A= ra,! a. (7) 


We can also introduce the expectation value of A in the coherent state \{o,}> 


W= | & ;* (x) A(x) d?x = ¥ [a,|, (8) 
J JL a 
where 2 
A(x) = LIPS > e; (k) a, exp [i(k.x — ckt)] (9) 
k s=1 


and the complex number «,, is given by 


Ags| {2s }> = Os] {a%.}). (10) 


We express the density matrix in terms of the Fock and coherent states(25°21:2) 


p= 2 p(t}, {7,}) | {1,}><{m,} | = ce | {xa} >< {og} [d?{o,}. (11) 


p({n,}, {m,}) are the Fock matrix elements of the density matrix and ¢ ,({«,}) 
is a weighting function (this is assumed to exist at least as a generalized 
function for the fields under consideration). Using the commutation rules and 
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p({n,}) = p({n,}, {n,}) we can then write for the moment <A") 
Cay = ¥ p(n) tna} > 
= [ atteadctoadlae{Z]4i"@) yo dx} Has}ra{ay}, (12) 


where W, indicates the operation of normal ordering. Substituting the unit 
operator )" |{n,}><{n,}| = { into (12) and using the identity 
{na} 


2n, 
Kadi? = 1 ae exp (— [a,l) (13) 
we obtain 
ee) 2na 
¥ pont = ¥ { f bares) TT exp (—lasia(ax}}n', 4) 
n=0 {na} a Mj: 
where 
Pn) =X pttna}) (15) 
eae 


and Al{n,}> = n|{n,}>. 
From (14) it follows that 


lo [2m 
pon) = ¥_ | bac{Cas)) 2 exp (InsP) 4 fa}. 18 
rim a Aye 
Using the multinomial theorem 
A ae Pek 
x" _ (2 a) (17) 
Ing=n A n,! n! 
we obtain?*7:) 
w” . 
p(n) = Pa Aaah) exp (— W) d?{or,}. (18) 


Then substituting 


P,(W’) = | #({a3})9( W"— y i?) d?{a,}, (19) 
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we finally arrive at 


-- exp (— W) dW. (20) 


iv 9) 
p(n) -{, Py(W) 
This is the photodetection equation. 

We have thus obtained the same relation for the distribution of the number 
of photons in the volume L’, as for the distribution of photoelectrons emitted 
within the time interval T. Thus, although it is the statistical properties of 
photoelectrons which are studied in photoelectric measurements, these directly 
reflect the statistical properties of the photons themselves. 

The situation would be completely different if the field were detected by a 
so-called “quantum counter”,(?*-?*) operating by stimulated emission rather 
than by absorption. These detectors measure antinormally ordered products 
of field operators, so that the commutation rules for re-ordering of normally 
ordered products to antinormally ordered products give the contribution of 
the physical vacuum through the commutators. (In this case, since 


5 at Aex=3 i 4A, d?x—M, 
ij ij 


through the number of modes M.) The distribution of counts p’(n) is given by 
the same relation as (4) with P_,(W) > P,(W) in antinormal ordering?» 


P(r) = | P.lW) exp (—W) AW, (21) 


but in this case, the distribution of photons is given by?” 


d \ntM-1 
pin) = (5) [PW exe Wao (22) 
so that oOo pyatM-1 
PAW) = exp (— a ORE 5 
a(W) = exp ( Wy GaM— pie” (23) 
The relation between p’ and p is obtained by substituting (23) into (21)? 
’ — ae n+a+M—t1 
Pier) = B pogarnnm (EEE MT) (24) 
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3. DETERMINATION OF THE INTEGRATED INTENSITY DISTRIBUTION 
FROM THE PHOTON—COUNTING DISTRIBUTION 


To determine the statistical properties of ight from photon-counting statistics, 
it is necessary to invert the photodetection equation (4) (we shall consider 
a = 1 for simplicity). Denoting n!p(n) by qg(n) and P,(W) exp (—W) by 
Q(W), we see that this problem can be reduced to the moment problem, i.e. 
given the moment sequence 


[ocr Wedw=<Ws, k=0,1,... (25) 
0 


we wish to determine the distribution Q(W). Thus, determination of P_,(W) 
from its moments (i.e. from the factorial moments of p(n)) is the same kind of 
problem as the determination of P,(W) from p(n) itself. We shall confine 
ourselves mainly to the determination of P y(W) from p(n). 

We can see that a formal solution of (4) may be written in the form?® 


Py(W) = exp (W) » (— 1)" p@™d(W), (26) 
where 6 (W) is the nth derivative of the Dirac function. 


3.1. METHOD USING AN ANALYTICAL GENERATING FUNCTION?” 


In general, a mathematical expression such as (26), with an infinite number of 
terms, may be meaningless. Such a series cannot usually be considered as a 
generalized function, but it can be considered as a so-called ultradistribution 
(we will return to this point later). However, here the series does have meaning, 
because the characteristic generating function is an analytical function, 


exp (XW) p= } ; P,(W) exp (ixW) dW. (27) 


This is because P,-(W) = Ofor W < 0. Expressing 6 (W) in (26) as a Fourier 
integral, 


5° (W) = > iz . (ix)" exp (ixW) dx, (28) 


we obtain 1 é 
P,(W) = 57, xP cw) G(x) exp (ixW) dx, (29) 


where G(x) = SS (— ix)" p(n). (30) 
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This is the generating function of the distribution P,-(W) exp (—W). This 
method was proposed by Wolf and Mehta. The analytical form of the 
generating function G(x) makes it possible to obtain the values of G(x) for all 
x by means of an analytical continuation if the series (30) has a finite radius of 
convergence. Thus, P ,(W) is unique. 

For example, for the Bose-Einstein distribution we have 


p(n) = Cay" + <n) **, 


and we get 
G(x) = (1+ <n) +ix<n>)~* and P(W) = <n>~* exp (— W/<n)). 
For the Poisson distribution, we have 


pia) = <n)" exp (—<n))/n! 


and we get 
G(x) = exp[—<n) (1+ix)] and P,(W) = 6(W —<n)). 


If the factorial moments of p(n) are given, the moment problem (25) can be 
solved in an exactly similar way. 


3.2. METHOD OF ORTHOGONAL POLYNOMIALS3°~34) 


Since every term of (26) represents a generalized function with one-point 
support, we must know all the p(n) to be able to reconstruct P,(W) as an 
ordinary function. This cannot be done in practice. Therefore, we give here a 
method based on the decomposition of P,(W) in terms of Laguerre poly- 
nomials, in which every term has the whole complex plane as a support. Such 
a prescription can be used to determine P,(W) approximately, if a finite 
number of p(n) (or its factorial moments) are known from experiment. 

Firstly, we consider the one-mode case with phase information, so that we 
invert the formula@!:25) 


p(n. m) = | A exp (—lal?) 4.10) a, G1) 


where p(n, m) are the Fock matrix elements of the density matrix and ¢ ,(a) is 
the weighting function of the “diagonal” representation of the density matrix 
in terms of coherent states. This problem was solved by Sudarshan‘? in the 
form of a double series of the derivatives of the 5-functions. From the 
mathematical point of view, such a series usually cannot be considered as a 
generalized function. For this reason, Sudarshan’s original prescription for 
constructing @ 4-(«) was criticized by a number of authors. Furthermore, it 
was shown by Cahill* that such a series does not lead to a distribution on 
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the D,-space of testing functions if an infinite number of photons are present 
in the field. However, it was also shown by Miller and Mishkin®® that an 
infinite series of successively higher derivatives of the 5-functions can give rise 
to a distribution on the Z,-space of testing functions, which is the Fourier 
transform of D,. Mehta and Sudarshan,°” and Klauder, McKenna and 
Currie®® (see also Klauder and Sudarshan?) have shown that all density 
matrices can be expressed as limits of sequences, whose members each have a 
diagonal representation. These sequences can be composed of a finite number 
of derivatives of the 6-functions or of square integrable functions. Deeper 
insight into this problem of the existence of the ¢,-function was given by 
Cahill and Glauber.“”) They have shown, on the basis of a kind of arbitrary 
ordering of field operators (s-ordering), the way in which the singularities of 
the ¢ ,-function appear. Cahill@* has proposed some rules for regularization 
of the ¢ ,-function. 

We note here that we can also obtain some results concerning this problem 
from our decomposition of ¢, in terms of Laguerre polynomials. Let us 
consider the function ¢ ,-(«) in the form®* 


b.e(@) = exp—E~ Dlal*Y | ¥en" L Ele?) 


+ enter LAC}, 2) 


where € > | is a real number and L,* are the Laguerre polynomials. Multiply- 
ing (32) by a** L,* (Ela|) exp (—|a|?), integrating over é« and using the 
orthogonality condition for the Laguerre polynomials, viz. 


a —. WG+44+ 07 
a x* exp (— x) L(x) Ly'(x) dx = "FG 4h (33) 

we obtain the coefficients c;, in terms of the Fock elements p(n, m) 

cn = apap | bared exp (— lal?) a4 LAC?) o(e) 
s! 
=m ON (2) (Gem) eres — Go 
where we have used (31) and 

Law) =17 + arr $Y ——— (35) 


=o S17 — s)i(s + AE 
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Equation (34) can be inverted into the form 


piss +2) = say (SEP L- ws )G+ de, (36) 


We see from (32) that 


oo { 
{ #2) dla) = mY ¥ te pieeae 


tay Y len ; (37) 


r [G +41? 
I! 


where we have put € = 1 and du(a) = exp (— |a|*) d?a. 
Hence, if (37) is finite, then ¢ ,-(«) is an L,-function in the space dy, and the 
series (32) converges to ¢ ,-(«) in the norm of L,, i.e. 


lim | (by — $4)? dp = 0, 
N70 


where ¢? is a partial sum of (32). In this way, we can determine from (34) 
a class of physical fields, specified by p(n, m), for which ¢ ,-(«) is a member of 
the L,-space.°* Also, using (32) we can construct sequences, whose members 
lie in L,, which converge to the diagonal weighting function @ we 

For example, for the Gaussian field specified by p(n, m) =6,,,.<(nr>"/ 
(1 + <n;>)'*", for which ¢,(«) is an ordinary function (nn r>) 1 exp 
(—lel?/<nz>)), we have cjp = (1/j!) (1 + <ny>)747*, and (37) gives 


>: [dl + <np>)?1 I> * = (nz? + Kn), 


which is indeed finite. 
We give two examples of states, for which (32) leads to a generalized 
function. 


a. The coherent state |f>: 


# (a) = 8a — B) = lim —exp (— IBt?) 


M 
«> ee y Sopa (oat Lit (\BI) + y (e)| (38) 


j=0 \ A=0 
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and if it is averaged over the phase of a: 


1 
b(lal) = d(\el? — |p|?) = eee APCs [B|*) >» oi L;° (|a|7) L,° (1B|?). 


(39) 
b. The Fock state |n): 
(lal) = (— 1)'5((al?) exp (— fal?) = lim — yo i? (la). 
(40) 


Since (4) does not depend explicitly on the number of modes M, putting 
4 = Oin (32) we obtain 


PW) = exp[— (E- 1) WI Y Ly (EW), (41) 


where L, = L,°, and from (34) 
= ep= 5d (- (1) ee. (42) 


If only the first N terms of p(n) are known, the accuracy of this approximation 
is given by (we put € = 1) 


[ecar) PCM) exp (— Waw= SF (allel? <e, 4) 
0 n=Nt+1 
where P_,‘") denotes the function constructed as an Nth partial sum in (41), 
and ¢ is arbitrarily small. 

Sometimes it is more suitable to use another decomposition of P WW) 
which has an explicit dependence on M: 


P,(W) = Ww™-t » ¢;Lj;“"'(W), (44) 
ae oe (= 1° 
where o> GiMen! oy Gas!+M ait PO): (45) 


Some examples demonstrating these methods will be given later. 
If we want to determine the distribution Q(W) from its moments <W*) = 


few) W* dW = q(k) it is sufficient to substitute P ,(W) exp (—W) = Q(W) 
10) 
and p(s) s! = q(s) into (44) and (45). 
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4. THE FORM OF THE INTEGRATED INTENSITY DISTRIBUTION 


We shall now use some theorems about moments from statistics“°*” to show 
clearly the conditions under which the integrated intensity distribution, given 
by (41), represents either an ordinary non-negative function or a generalized 
function whose support is composed of a finite number of points. 

We define the moment sequence 


M,, -{ W" dF(W), n= Oly sss (46) 
0 


where dF(W) = F'(W) dW = P,,(W) dW. In the same way we can consider 
the sequence {p(n)}, denoting p(n)n! by M, and exp (— W) P,(W) by F'(W). 
We now introduce the following system of quadratic forms“ 


= "5 


i,k=0 


cof m 2 
Mj 44 Uj Uy ={'(x, mu) dF(W), m= 0, ee (47) 


where {u;} is an arbitrary non-trivial real vector. Thus the necessary and 
sufficient condition for dF(W) to be non-negative is Q,, > 0, for all m and an 
arbitrary non-trivial vector {u;}. The condition Q,, > 0 is also necessary and 
sufficient for the existence of a non-decreasing function F(W) (0 < W< ©) 
which fulfills (46) and has an infinite number of points at which the function 
increases.” Hence, if Q,, > 0 for all m and an arbitrary non-trivial vector 
{u;}, then it is possible to construct the distribution F/(W) = P,(W) > 0 
from the given moment sequence {M,}° in a unique way. Another case may 
arise if, for some m and some non-trivial vector {u;}, the quadratic forms (47) 
equal zero. It can be shown”) that in this case, a unique distribution P_,(W) 
exists, whose support is composed of a finite number of points. This is the 
minimum number of m for which Q,, = 0. On the other hand, if the function 
Py(W) = F'(W) is concentrated at a finite number of points (say M) then 
Q, = Oform = M,M + 1,...and fora non-trivial vector {u;}. 

These results can be demonstrated by the following examples. Let us 
consider the system of determinants 


M,M, ... M, 


a a ins Me er (48) 


Mn Mn+1 se Mom 


If ,, > 0, then Q,, > 0. These conditions can easily be shown to be valid for 
thermal light for which M, =n! <n,>", so that the function P,(W) must 
exist as a unique ordinary function. (In this case Py(W) = <n->~! exp 
(— W/<nz>).) For the coherent field M, = <n,>" and we see at once that 
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9, = 0 form 2 1 and By = 1. Thus, Qy > 0 and Q,, = 0 for m > 1. Using 
the above theorem, we find that the corresponding function P,(W) is a 
generalized function having a one-point support. In fact, this distribution is 
equal tod(W — <n,)). 

Hence, if the moments M,, are known from experiment we should be able to 
decide, using the determinants (48) whether the distribution P ,(W) exists as 
an ordinary non-negative function or as a generalized function, whose support 
is composed of a finite number of points. 


5. s-ORDERING OF FIELD OPERATORS IN QUANTUM OPTICS 


Before demonstrating the methods developed above, we shall mention the 
problem of arbitrary ordering of field operators in quantum optics. This will 
enable us to give a number of examples in a very general form. 

The problem of ordering field operators in quantum optics plays an in- 
creasing role in connection with generalized phase space distributions of 
optical fields‘?!»?7-25-2°42), and also in laser theory.“*4*) Recently the prob- 
lem of ordering field operators, and the correspondence between functions of 
c-numbers and functions of g-numbers has been systematically treated by 
Agarwal and Wolf." (See also Lax.4>) Their method was used for treating 
quantum dynamics problems in phase space, and for obtaining an expression 
for the time ordered product of Heisenberg operators, in terms of products 
ordered according to a prescribed rule (Wick’s theorem can be deduced as a 
special case). A general treatment of this problem was also given by Cahill and 
Glauber.“”) They calculated quantum expectation values based on a kind of 
parametrization, which makes it possible to interpolate between normal and 
antinormal orderings, paying particular attention to the normal, symmetric 
(Weyl) and anti-normal orderings, and their differences and inter-relation- 
ships. These methods show when and how singularities appear in the cor- 
responding weighting function, particularly in the weighting function of the 
diagonal representation of the density matrix in terms of the coherent states. 
An extension of this formulation to M-mode fields is also possible and fruitful 
in some cases, particularly for the superposition of coherent and thermal 
fields.1 8? 

This formulism is based on the decomposition of operators in terms of the 
so-called T-operators which are defined by 


T(B,s) = n~ | exp [islal? +a(a" — B*)—a*(a—f)} d’%  —1<s<l, (49) 


for which we have 


Tr{T (a, s) T(B,— s)} = nd(a — f). (50) 
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The density operator p can be decomposed in terms of Tin the form?!” 


p= x! | bB,— 5) TBs) 0B, (51) 
where 
$(B,— s) = Tr{p T(B,— 5)}. (52) 


From (49) one can see that the T-operator represents an s-ordered 6- 
function, and is equal to the 6-function for s = 0. The cases of normal, 
symmetric and antinormal orderings can be obtained for s = 1,0 and — 1. 
For example, for s = —1, 7(6,—1) = [B><B| and (51) gives the diagonal 
representation in terms of coherent states. 

Using such a formalism we can derive the following relation between s,- 
and s,-ordered generating functions“) 


. _ Sy —- 8 , ~M iy 
<exp GM>= (1+ 3 v) (ox [al a 


The corresponding moments are 


iy=0 


-gemam(2at) (Gs), 6 


and the corresponding integrated intensity distributions are given by 


rons Ef (a) on[ MED 


Sy—S5,J0 \W So —- Sy 


k dé . 
CA 5 = day <exp (iyh)>,, 


ww’)? 
Ino (44 ) 
52 


. P(W,’ s2) dW’ fors, > S;, (55) 
— 31 


where I~ 1(x) is a modified Bessel function. It is also possible to derive the 
generalized photodetection equation“ ® 


= GemawilTes) (Far) [mont 


4w 2W 
—_ - dW. (56 
« (=r) | | (66) 
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Substituting s = 1, and using the asymptotic formula 


(n+ M-—1)! 
n! 


(— x)’, (57) 


L,""1(x) ~ 


we obtain, from (56) the standard photodetection equation 


exp (—W) dW, (58) 


w" 
n} 


p(n) = { : P,(W) 


where Py (W) = P(W, 1). 


6. SUPERPOSITION OF COHERENT AND THERMAL FIELDS 


We shall now give some physically interesting examples to demonstrate the 
methods developed above. 


6.1. ONE-MODE CASE 


First we shall consider the superposition of coherent and thermal one-mode 
fields. 
The Fock elements*® for this case are 


Mon (earl sre 


eo eee | 
«hn CEE SNE ” 


where <> is the mean occupation number in the thermal field and f is the 
complex amplitude of the coherent field. We substitute these into (34) and 
obtain 


ee eee -s|Se 
BR + Cappy OP LT > | TG FOP 
ELBI? 


x [1 + <np>(1 — OV B* Lf ( ) (60) 


(1 + <np>)[1 + Kap — 8] 
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Then substituting this result into (32) and using the identities® 


co —a/2 
dias a hat (x) L,7 (y) 2" = oe 
4 
x exp |- z= 2! 4 I, (2 oo), (61) 
I_ (x) = 1,(%) 
and +00 
ps I(x) exp (ind) = exp (x cos @), (62) 
we obtain 2 
7 _ la — Bl 
$.4(@) = (adap) exp | - SEE], (63) 


This is the diagonal representation function for the superposition of coherent 
and thermal fields. 
By using (61) and (62) we can easily verify that for the coefficients (60), 


: 3 
¥ eal? wie. <0 (64) 


Note that the case of a coherent field can be treated as the limit <n7> — 0 and 
that of a thermal field as the limit B — 0. 


6.2. MULTI-MODE CASE 


The normal generating function for the superposition of coherent and thermal 
fields can be written in the form@?1%3%) 


. Pe & _f -1 ix<n,4> 
(iy exp (xa) = I (1 — ix<n;,>)7' exp | a |. (65) 


Using the identities?” 


eet ixA oe (ixB)” x A 
c-uayew| a] = Lag wcme (-g) oo 


A 0 1 
=(1+B)-™ es Er Mets Pesca 
ae) exp | <3| Dare 


x(1 + 3) (1 ix LM} (- sam) (66b) 
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the following expressions can be obtained for the photon-counting distribution 
p(n) and its factorial moments; 


1 d* 
p(n) = al d(x) ( I exp (x) 


lix=-1 


_ _ «Ney» 
=[]G + arid)? exp |- i a 


1 ne <N4> 
pecs aaa? | ae (ee eag teen, Re 
% nex (: 7" as) 2( Gad + a a 


and 


d* 
WOO aes (I exp (x) . 


ix=0 


1 <Nea> 

=k! — (nz, Le (- ee). 68 
aria dl inst ray * Lin, Gas (68) 
The corresponding integrated intensity distribution can in principle be 
obtained from (44); 

ALM (W) J (— 1)* 


PAW) =w-! ¥ J 
j 


=0(¢+M-—)! » G—s)ie+ Ma iP (69) 


From these general results one can obtain, for example, the formulae for the 
detection of a thermal field using a heterodyne technique. In this method, the 
thermal field is superimposed on a known coherent component. Jakeman and 
Pike, who proposed this method, pointed out that additional spectral in- 
formation can be obtained, i.e. the central frequency of the thermal field can 
also be determined by this method. In general, the central frequency of the 
thermal field and the frequency of the coherent field are arbitrary and different 
from one another. We must therefore consider the superposition of coherent 
and thermal fields of different mean frequencies. The results can be obtained 
from equations (67), (68) and (69), taking 2 = 1, 2, ..., M for M modes with 
<n) = <Nz>/M, andi = M + 1,...,2M for M modes with <n7,> = 0. 

We may write 


= y | 3 (70) 


2M j=0 M 2M 
Ungan Lng=j Ungan—j 
A a A=M+ 


Using the polynomial theorem and an asymptotic expression for the L,,°- 
polynomials, together with the following identity for the Laguerre poly- 
nomials*® 


DETERMINATION OF THE STATISTICAL PROPERTIES OF LIGHT §29 


il 1 ] etn y 

——_—— _ Lit (x)= 7 j ( x) 
1 MA M J 

F macs a (m, aa)! (4+2a,+m-1)! é 

a A 


(71) 


we obtain®” 


<n) \-™ <ne{M + <nry(1 = °)] 
OE) mee 


. 1 


. (1 +5 ei (- a) (72) 


[<n,>(1 — w*)y"4 


<n «nz (Knz> + M) 
and 
wy Yoo" Yl — wR! 
4 fok-DIG+M— DI 
ny) 4 M-1 {n.>@’M 
oe ea: ) 


M 2M 
where > <n,,> = <n.) @” and Y <na> = <n-(1~o?). 
Aa A=Mt+1 


In these equations the parameter w = [2 sin(Q/2)]/Q, where Q is equal to the 
difference between the mean frequency of the thermal light and the frequency 
of the coherent light multiplied by the time interval T of observation. 

For Py (W) we obtain 


M (= <n.>(l — on“ 
nr» <n.» a? 
1 W + <n.» (2m? — 1) 
x exp | oo 

[<n.>(W — <n. (1 = or) 

«n> 

for W > <n.>(1 — w?) 

OforW < <n. (1 — w?). (74) 


Py(W) = 
x Typ (200M 
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The s-ordering formalism mentioned above gives us the more general 
formulae 


Re os k! l—s\* & I 2<nr)_\/ 
OG asain 2 ) Raemmlas ow) 


2 oe eee (enw) 


<n7> l—s 


x LjM-4 ( ) (75) 


and 


2W <n.» 
es 2 |- i—s <np/M+(1—s)/2 


2<n,<nr) (1 — @”) 
M(1 — s)[<n7>/M + (1 — s)/2] 


‘armen llasacas) ae 


. (= 1 o) | teas (4% er — oy) 


l—s 


(1 — s)w*M<n,> 


| 
ao}. 76 
2<np>[r>/M + (1 - oa) me 


x iat 


A number of special cases can be obtained putting a = 1,05 =1,0, —1 
(normal, symmetric and antinormal orderings), M = 1, etc. For example, for 
the case where the mean frequencies of the thermal and coherent fields are 
identical, ic. @ = 1, we have!® 


i k! <ny> D8 -1f _ se | 
W.= gop t a) 4 (-a ae . 
and mp) 


(<n.>W)* 
<n7p>/M + (1 — s)/2 


Po. (2 ) (78) 
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The case s = — 1 would be useful for determining the statistical properties 
of light using “quantum counters”? 
More general results can be gained from the s-ordered generating function 


<exp ix), = T] E ae (<0. me = 


ix «Nea? 


1 —ix(n;,) + = me 


x exp | 


We obtain 


Oe he <n4) 


& ne Cent ht 1) (ee 
re eal milo? +5" | 18 (-G Para) @O 


The general integrated intensity distribution can be obtained from (W*),; 


P(W, s) = W™-* exp(—W) Yc, L,™-* (W), (81) 
n=0 
_ n} ¥ (- 1 j 
where c= (iM — Di 2sjl@o D+ MoD! CW"), (82) 
with the condition that 


[m+ M — 1)! 


= lol2< 00. (83) 


[?207,9) Wt exp (yaw = y 
0 n=0 
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Particle Beam Fluctuations in Quantum Mechanics 


C. M. BENARD 


Photon beam fluctuations have been studied by many authors. Information 
about the statistical properties defining a photon beam may be obtained by 
coincidence and counting experiments and the properties are found to vary 
from those of Gaussian light to those of Jaser light. We know how to explain 
these results for bosons in quantum optics by using a theory which enables us 
to describe both the field and the detector acting on the field. The results are 
well known and we shall now ask what statistical properties we might expect 
to observe if a counting or a coincidence experiment is performed on a beam 
of fermions. Can the statistical properties of fermion beams vary over a range 
comparable with that of photons from Gaussian or other sources? This ques- 
tion is very broad and we shall not provide a complete answer. The point we 
wish to stress here is the connection between the various statistical properties 
that may be detected in a system of particles and the quantum nature of the 
particles. 

We shall define the statistical properties of a beam of particles in terms of its 
measured coincidence probabilities. The coincidence probability P,(a,, t,, ..., 
a,, tp) is the probability that any particle will be detected at point a, at time 
ty,..., any other at point a, at time t,, where p is any positive integer. This 
choice of the coincidence probability is determined by two considerations. The 
first is that the set of the coincidence probabilities should completely define the 
statistical properties of the beam (including their time dependence), and the 
second is that they may be measured directly. 

In order to describe the system of particles and their detection, and to cal- 
culate these probabilities, we shall use the wave packet formalism. This formal- 
ism enables us to describe and compare the behaviour of boson and fermion 
beams in coincidence experiments. The formalism was first used by Goldberger 
and Watson) to calculate the second order coincidence probability for a 
particle beam, but they used a normalization which is not strictly valid. 


1. THe WAVE PACKET FORMALISM 


Consider a system of indistinguishable and non-interacting particles. The 
number of particles, N(@), is a random variable. A stochastic wave packet 
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®; (r, t, w;') is associated with every particle of the beam located at r at time f. 
The choice of a value of the random variable «,’ corresponds to the choice of 
@; among a given set of functions (depending on the beam). All the ®, are 
identical stochastic functions. We consider the set of particles as resulting from 
the superposition of states (w, «’), where the number of particles and the wave 
packets are given. Each of these states is described by a given wave function, 
y. Since the particles do not interact, we can calculate w from the functions ®,. 
The beam is described by the stochastic function W[N(@), 1, ...5 Ty(ay> ty O"] 
depending on the random variable w and on the set of random variables 
wo’ = {q,’}. All these variables may, or may not, be independent. 


2. IDEAL DETECTORS 


Consider an ideal two-level atom used as a photon detector (see chapter by 
Glauber in this volume). This detector is assumed to act in a volume, and with 
a response time very much smaller than the volumes and times characterizing 
the field. Such a detector placed in the field tells us about the presence, or 
otherwise, of a particle at point a, at time 1, i.e. it projects the system at time ¢ 
from the state y, which it may occupy before the measurement, into a subspace 
&(a, t) of the space describing the states of the system. &(a, t) contains all the 
states of the system for which we know that a particle is at point a at time f. 
The probability for such a projection to occur is given by the first order coinci- 
dence probability. We can make a measurement by using p detectors located 
at points a, ..., a, at times ¢,, ..., f,; each detector acts as a projection opera- 
tor in a sub-space &(q;, t;). The probability for detection of a particle by every 
detector is the pth order coincidence function P,[ay, t,, ...,@,,t,]. For ferm- 
ions, ideal detectors based on secondary emission, for instance, may be 
devised. For bosons and fermions, then, it is theoretically possible to locate a 
particle in space and time and so to measure coincidence probabilities. 


3. GENERAL FORMULATION OF COINCIDENCE PROBABILITIES 


3.1. COINCIDENCE PROBABILITIES AT A GIVEN TIME 


Consider a system consisting of one particle described by a stochastic wave 
function ®(r, t, w’). The ensemble average <|®(a, t)|?>t gives the probability 
of detecting the particle at point a, at time t, where ® is normalized on a certain 
volume V where the particle is known to be. 

If there are N(w) particles inside V, the probability of finding a particle at a,, 
any other at a2, the Nth at ay, at time ¢, while the system is in state (@, w’) is 


{The symbol ¢ _ 5 indicates an ensemble average. 
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Py ,(«,«') (41) «++» Ay, t) and is such that 


| Pye) Lis + Ty tl dr, ... dry = 1, (1) 
ry Sr2S...Sry, 
where rj, ..., fy(V). In performing this integration, we are considering all the 
possible different N coincidence measurements that we can make on this set of 
particles. We know that by these measurements, we are sure to detect the N 
particles of the system, this justifies eqn (1). 
Since the normalization condition on w may be written 


i IW}? dr, ...dry = 1, (2) 


it is easy to show that 
Py (o,0') [a;, ..., dy, t] = N(@)! |WLN(o), ay, ... dy, t, | a (3) 


However, we are interested in more general coincidence probabilities, viz. the 
probabilities of detecting p particles, without knowing the total number of 
particles. To determine these probabilities, we just calculate the conditional 
probabilities of detecting p particles at a,, ..., a, at time t, knowing that there 
are N(@) particles in the system. The probability we are concerned with is an 
a priori probability obtained from the conditional probability by taking the 
mean value over w and w’. We obtain 


P 
P,(a;, ..-» A, t) = (ff fy|? y T] 6(@:—rp,) dry ... ary) (4) 


Bi #f2#...#Bp i=1 


(B,, ..., B,) are permutations of p elements taken in N elements. 


3.2. COINCIDENCE PROBABILITIES AT DIFFERENT TIMES 


Using the Dirac formalism, the wave function y is written as a vector |W) 
of the state space. We can describe the action of p detectors acting at times 
ti, < t,, < ..., < f, by an operator. This operator is the product of the pro- 
jection operators on &(a,, t;), P(a;, t,). The probability may be written 


(WIP Gr, t1) «.- Py, ty). PA, t)IW)>. 


In the general case considered so far, this expression is quite complicated but, 
in the stationary incoherent case described below, it can be shown that such a 
probability may be obtained formally from the coincidence probability, 
P,(41, 42, ..., 4p, t). In fact P,(a, ...,a,,t) may be described completely in 
terms of the function ®,(q,, t, w;’) and P,(a,, t,; ...;a,¢,) may be written in 
exactly the same way only by changing 9,(q,, t, @,') into ®,(a,, t,, @,’). So, in 
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the following we shall only consider eqn (4). Any other coincidence probability 
can be easily obtained from the results we get. 


4. COINCIDENCE PROBABILITY IN THE STATIONARY INCOHERENT CASE 


To calculate P,(a,, t1, ..., dp, t,) given by eqn (4) we have first to expand y 
in terms of ®. Since y is normalized (eqn (2)), it is written as 


N 
w= LPy*(+1)* I] ®,,(r;, t, o') 


x wef EPy®(EDE I ®p, (r,t, 0’) 
Vv Vv i=1 


N = neck b 

x XPy*"(41)*" [] ®,,* (7; t,o) dry ... ary | ‘| (5) 
i=1 

Py* is the sum of all the permutations of order k, of a, ..., dy. 

If the functions ©, were orthogonal, the denominator of eqn (5) would re- 
duce to ./N!. Goldberger and Watson assumed that this was the case, but it 
can be shown that this is valid only in certain cases. We cannot assume, a priori, 
that the functions ®,; are orthogonal for all types of beam. On the contrary, 
the ®;, being stochastic identical functions, it can be shown that it is impossible 
for them to be orthogonal for every set of <@;’> = w’. Therefore, P, must be 
written in a general way thus, 


N 
Pe = (If ‘ 5 . I ®,(r;, t, o') X Py*(+1* ®,,* (r;, t,o) 


ce ll 6(a;—rg,) dr, ... ary] 


Bit. #Bp j= 


x iste [1 (ry t,o’) E Py (EK 


i=1 


-1 
x O,,* (7; t, 0’) dry ... drs] , (6) 


This formal result is too general to tell us much about the properties of Poe 

For the case of a completely incoherent stationary beam, it can be shown 
that the denominator, D, of eqn (6) cancels with factors in the numerator of 
(6). The probabilities P, are then rather simple and several remarks can be 
made about them. We define a completely incoherent beam as follows: 
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a. the ®, are identical and stochastically independent; 
b. the stochastic laws of N(q) and of the ®, are independent; 


c. the beam is stationary in time and space (in particular we consider such 
an extension of the beam that the spreading of the wave packets may be neg- 
lected. 


To study this stationary incoherent system, we shall consider the two dif- 
ferent cases which cover the few experimental situations. 


1, The number of particles, 5, passing through the coherence area during the 
coherence time 7 is very much less than 1, This condition is satisfied by mono- 
chromatic natural light (6 ~ 107 *) and by the most monoenergetic and power- 
ful electron beams we can produce (6 ~ 107 *~107°). In this case, D ~ 1. 


2. When 6 is not small compared to 1, the total mean number of particles, 
<N>, is quite large. By using the law for large numbers, we can show that the 
factors appearing in the numerator cancel with D, in (6). Then D > 1, and the 
calculation of Goldberger and Watson is not valid. 

In both cases (1 and 2), we obtain the same expression for P,, 


Pp 
Pi (Gy, ty «0+ ap» ty) = wy I] ®(4;, t;, @;') 


N-1 Pp 
x >» P,X(+ 1)* I] ®,,* (a;, tis wi) . (7) 
K=0 i=1 


If we set 
n,(a, t, @,') = 0 (a, f, @,;') <(®|7>74, 


we can rewrite P,, in the form 
Pp Pp 
iy = or I n(@;, ti, @,') rP,*(+ 1)* [1 1a.* (a;, tis 0) 
I= i= 


where p is the mean density of particles in the system. 


5. DISCUSSION 


a. We first note that in the range of the possible beams considered (non- 
interacting particles), the statistical properties do not depend on the strength 
of the beam. 


b. Putting p = 2 in eqn (7), we get, (see Fig. 1 for |y,2| = exp (—2|t,— 
ty|/t)) 


P2(a4, ty, @z, t2) = p?{1 +ly1217] (8) 


540 C. M. BENARD 
where 


Y12 = <n(ay, ty) *(@, t2)> = y24*. 


Equation (8) gives the well-known bunching effect for bosons (+ sign). For 
fermions (— sign), it shows an “antibunching effect”. In the latter case, the 
conditional probability of detecting a particle at a given point at a given time 
when we have already detected another particle at a neighbouring point and 
time (i.e. within the coherence volume and coherence time) is smaller than the 
a priori probability of detecting a particle within the coherence volume during 
the coherence time. This second order effect could have been predicted easily 
from our knowledge of fermions. The interest in our result lies in its applica- 
tion to higher order coincidence probabilities. 


Bosons 


Fermions 


Se ee oes Dens Ee beess ree Ss Sew er aes he ee ee ec CO ale 


05 1-0 2-0 
t/t 


Fic. 1 


c. Equation (7) shows that both the bunching effect and the antibunching 
effect occurring in an incoherent beam of particles come from the terms cor- 
responding to non-zero order permutation. This means that, if the particles 
were distinguishable, the P, would merely define a Poisson process. But, 
because of their indistinguishability, the particles which are assumed to be 
stochastically independent when emitted, can only be detected as bunched or 


antibunched. 


d. All the well-known results for bosons are obtainable from eqn (7) when 
the positive sign is taken. The P, obtained are the coincidence probabilities of 
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a compound Poisson process. They are equal to the moments of a stochastic 
real positive function 7 (r, t) given by 

TJ (r,t) = E(r, t) E*(r, t), (9) 
where E(r, t) is a Gaussian stochastic function whose correlation function is 
given by 

(E(a,, t,) E*(az, t2)> = py 12. 
It can be considered as the field associated with the particles. 
e. Fermions have several interesting features. 


1, Consider the one-dimensional stochastic process which is a function of 
time only, obtained by making all the coincidence measurements at the same 
point in space. Assuming that the correlation function 


V12 = <n@, 41) n*(@, t2)> 
is such that 


¥13 =V1223 for t; <t, < tz, (10) 


the process is a renewal process. It means that the waiting time between two 
detected particles does not depend on the particles detected previously. This 
result depends on the assumption that the particles do not interact. The prob- 
ability for the waiting time to be equal to ¢, if [y,.| = exp (—2|t, —25|/r) is 
given by (see Fig. 2) 


t _ 2pexp (t/t) sinh [(t/t) (1—2p7)*] 
aia aaa 


(11) 
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From this it follows that 2pt < 1; this is simply an expression of the Pauli 
exclusion principle. 


2. We may also study the statistical properties of the number of particles that 
can be detected at a given point during a time, T. The probability P,(T) of 
detecting n particles during time T can be obtained from the set of functions 
P,. We see that for n > 2 and T <t, we have P,(T) < 0. Again, this is the 
Pauli exclusion principle. For T > 1 (i.e. T > 3t or 4t) we shall consider three 
stochastic processes, the mean intensities of which are the same. We shall 
assume that one is a Poisson process, the second is a Gaussian Poisson com- 
pound process, and the third is the detected process of a stationary incoherent 
beam of fermions defined by the same second order correlation function y,, 
as the compound Poisson process. Then we have 


P,p(T) = 2P,p(T) —P,a(T), (12) 


where P,,z refers to the Poisson compound process, P,,p to the Poisson process, 
and P,, to the fermion process. This symmetry between fermions and bosons 
is only an approximate one, that is allowable for T > 3r or 4t (then the rela- 
tive error is ~ 107%). The symmetry is completely true only for the second 
order statistical properties of the beam, viz. the second order coincidence 
probability, or the variance of the number n detected during T, for instance 
(see Figs. 3 and 4). 


3. We shall now consider the possibility of performing experiments to observe 
these statistical properties of a beam of fermions. Electrons appear to offer the 
most favourable conditions, with the power and monochromaticity of the 
sources now available. But these sources, if they give a coherence area large 
enough to allow good measurements, give a coherence time of the order of 
10~ ** sec. The fastest electron detectors available are generally 3 or 4 orders 
of magnitude slower than this, so they would completely smooth the effects 
we should like to observe. This situation is similar to that which confronts us 
when we wish to perform correlation or coincidence experiments on white 
light. 

The fermion problem we have considered seems to be beyond the reach of 
experimental confirmation at present. 


6. CONCLUSIONS 


The formalism we have used has shown in a very simple way how the bunch- 
ing or antibunching effect comes from the detection of indistinguishable parti- 
cles. This effect is very important when the particles are stochastically inde- 
pendent. It can be cancelled out, for bosons, if certain correlations are assumed 
between the particles (for example, as in the laser). 
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We can also analyse completely the incoherent stationary fermion-beam 
and stress several interesting characteristic features of it. It can be shown that 
this incoherent stationary case corresponds to thermal equilibrium. 

Now, it would be interesting, from a theoretical point of view, to have a 
broader view of the question, that is, to analyse how the statistical properties 
of a fermion-beam vary according to the different sources that could be con- 
sidered. In particular, the case of fermion-beams not in thermal equilibrium 
should also be studied. 
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90, 97, 117, 134, 183, 243, 246, 270, 
293, 485, 491 
of cavity mode, 252 
for electrons, 247, 252, 269 
infinite series in, 105 
for photon packet, summation of, 117 
polynomial in, 105 
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Cylindrical coordinates, transformation 
to spheroidal coordinates, 335 


D 


Damping, 433 
constants, 294, 315 
quantum theory of, 184-191 
Degeneracy parameter, 2 
Degenerate secular perturbation theory, 
135 
Delayed coincidence method, 144 
6-functions, 192, 195, 520, 525 
6-function structure, 113 
Density matrix, 136, 270, 355, 358, 515 
elements, 134 
single time, 267-269 
zero order, 360 
Density matrix equations, 263-267, 276, 
311 
derivation of, 264 
non-Markoffian, 297-298 
specialization of: 
for atom, 266 
for field mode, 267 
for light mode, 265 
Density operator, 67, 73, 75, 87, 91, 106, 
120, 134, 271, 272, 508, 525 
chaotic, 115, 122, 125 
equation of motion, 178-180 
expressed in terms of coherent states, 
89 
expressions for, 123 
master equation for, 494 
reduced, 134, 184-188, 197 
in the Schrédinger picture, 179, 188 
Depolarization, 441 
Dielectric constant: 
intrinsic tempreature dependence of, 
380 
local, 162 
Dielectric energy, orientation-dependent 
part, 374 
Diffraction grating as resonator mirror, 
345 
Diffraction patterns in the one-photon 
approximation, 58 
Diffusion coefficients, 196, 287, 291, 292 
Diffusion constants, 283, 287 
Diffusion matrix, 196 
Dipole approximation, 30 
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Dipole moment, 223, 228, 229 
operators, 251 
Dirac’s delta function, 14, 28, 92, see 
also 6-functions 
Dirac—Fourier integral representation, 
100 
Dirac notation, 13 
Dispersion, 479 
Displacement operator, 116 
Distribution functions, 282, 288, 298 
calculation of multitime correlation 
functions by, 278 
characteristic function for electrons, 
274 
for electrons and fields, 274 
equation for, 299-301, 310 
equation for, for set of two-level 
atoms, 275-278 
for the field, 270-273 
generalized, 279 
for a single electron, 273 
Doppler broadening, 110 
Doppler experiment, 164 
Doppler spectroscopy, 165-173 
of turbulent flow, 170, 171 
Double resonance experiments, 453-457, 
see also Brossel—Bitter experiment 
background signal, 406 
broad-band irradiation, 453 
compared with level-crossing experi- 
ments, 442-443 
disadvantages, 442 
light emission in, 455 
model of, 454 . 
modulation in, 457-459 
steady state, 453 
to study hyperfine structure, 402 
Double resonance line width, 401 
Double resonance method, 401—407 
application, 443 
Driven damped oscillator, 188 
Driving fields, 294 
Dyadic product of state vectors, 67, 91 


E 


Eigenvalues, 13 
continuous, 14 
integral over, 15 
discrete: 
sum over, 15 
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Eigenvectors, 13 
normalized, 14 
Electric dipole approximation, 65, 133, 
355 
Electric field, 31, 219 
strength of, 219 
Electric polarization, macroscopic, 356 
Electrocaloric effect, 380, 381 
Electrodeless lamps, 436, 437 
Electrodynamics: 
classical, 22-26 
quantum, 11-52, 57 
connection with optics, 53 
semiclassical, 26-30 
Electromagnetic field: 
intensity, 61 
mode decomposition of, 30-33, 62 
for infinite volume case, 40 
mode density in, 54 
quantization of, 30-42 
quantized: 
interaction with classical current, 
36, 37 
interaction with electron system, 
202 
total Hamiltonian of, 33 
vector potential, 246 
within a finite volume, 54, 59 
Electromagnetic spectrum, 53 
y-ray, 53 
a single unit, 54 
visible range, 53 
Electromagnetic waves: 
nonlinear coupling between, 363-365 
coupling coefficient, 364 
Electron density, fluctuation due to plas- 
ma waves, 385 
Electron excitation, 440 
difficulties, 440 
in level-crossing experiments, 441 
Electronic dipole moment, 134 
Electronic transitions, in laser atom, 492 
Electrons: 
classical radius, 49 
collision with visible quanta, 54 
ground state, 47 
interaction with electromagnetic field, 
11, 133, 135 
interaction with photons, 11 
possible states of bound, 133 
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Electrostatic potential energy, 26 
Electrostrictive effect, 378 
Emission, 138, 139 
probability, 138 
spontaneous, see Spontaneous emis- 
sion 
Energy-conservation condition, 49 
Energy flux vector, see Poynting vector 
Energy levels, unequally spaced, 468 
Ensemble average, 67, 536 
Entire functions, 90 
Entropy, defined, 106 
Equations of motion: 
for classical Brownian particle, 284, 
285 
density operator, 178-180, 510 
for distribution study, 194-196 
Hamiltonian, 246-248 
Heisenberg, 178, 184, 246-252 
for plasma coordinate, 386 
for rotating field, 449 
Schrédinger, 177, 182 
Expectation values, 143, 193 
evaluation of, 92 
Exponential functions of operators, 
multiplying, 97 


F 


Fabry—Pérot cavity, 324-331 
frequencies of oscillation modes, 325 
mirrors, 324 

Fabry—Pérot etalons, 5, 6, 166 

Fabry-Pérot oscillogram, 165 

. Fabry-Pérot resonator, 326 
arrangement of mirrors, 327 
with non-parallel end mirrors, 331 
phase shift, 328, 329 
plane parallel, 345 
power loss, 329 
saturable gain, 330 
Wainstein’s theory, 330, 331 

Factorial moments: 
for coherent source, 151 
for incoherent source, 151 
in the heterodyne case, 162 

Fermi’s golden rule, 135 

Fermions, 123 
antibunching effect for, 540 
chaotic density operator for, 125 
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counting experients on beam of, 535 
features of, 541-542 
maximum entropy states for, 124 
occupation numbers, 124 
possible experiments to observe stat- 
istical properties of, 542 
Feynman’s disentangling techniques, 
271 
Field, counter-rotating, 473 
Field-dependent polarizability, 7 
Field eigenvalue, 95 
Field equations: 
semiclassical, 219 
quantum, 202, 212, 213 
Field intensity, 68 
Field, oscillating, 473-476 
Hanle effect with, 474 
Field oscillating perpendicular to static 
field, 470-473 
Field oscillating sinusoidally, 464 
degeneracy, 466 
frequency diagram, 466 
interaction Hamiltonian, 464 
periodic terms in expansion, 465 
Field, rotating, 447-454, 473 
diagonalization of transformed Ham- 
iltonian, 449 — 
energy levels, 452, 453 
equation of motion, 449 
frequency diagram, 451 
interacting with unequally spaced 
levels, 468 
Field theory: 
quantized, 448 
semiclassical, 447 
Fine structure constant, 30 
Flat-roof resonator, 345 
convex, 348, 349 
field distributions, 350 
intensity distribution for modes, 347 
power-loss per transit in, 348, 351 
as section of diamond cavity, 346 
spot size, 348 
zeroth order mode patterns, 349 
Flow lamp, 438 
Fluctuating forces, 294, 315 
Fluctuation spectrum, 197, 198, 200 
Fock elements, 526 
Fock state, 522 
Fokker-—Planck differential operator, 512 
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Fokker—Planck equation, 4, 5, 286, 287, 
495, 496, 507 
classical derivation, 284 
elimination of atomic variables, 282, 
283 
exact generalized, 298-313 
derivation, 301-303 
evaluation, 308-311 
relations for derivatives, 303-307 
expansion, 281 
generalized, 270-278 
Green’s function solution, 198, 200 
for multimode laser action near 
threshold, 291-293 
for non-commuting macroscopic vari- 
ables of ground-state laser, 245 
for particle moving in two dimensions 
under friction force, 287 
reduction of generalized, 281-288 
for rotating wave van der Pol oscill- 
ator, 197 
for rotational Brownian motion, 374 
solution of reduced: 
non-stationary, 289 
steady state, 288, 290, 291 
Forced harmonic oscillator, 12 
Forced oscillator, 20-22 
Hamiltonian for, 20 
Forward scattering, 477, 480 
Fourier coefficients, 60, 62 
Frequency pulling, 234, 236 
Frequency pushing, 236, 242 
Frequency shifts, 430-433 
arising from collisions, 430 
arising from irradiation of atoms, 430 
light shifts, 430, 431, 432 
semiclassical interpretation, 432 
pressure shifts, 430, 431 
Fresnel number, 328, 333, 344 


G 


g-values of atoms: 

of excited states, 445 

precision measurement of, 421, 444 
g-value of spin system, 476 
y-ray beam, monochromatic, 54 
Gauge condition, 23 
Gauge transformation, 23 
Gaussian—Lorentzian field, photon stat- 

istics of, 160 
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Generating function, 142, 161 
Geometrical optics approximation, 351 
Green’s functions, 279, 289 

Green’s theorem, 25, 32 

Ground state as a reservoir, 208 


H 


Haken—Sauermann treatment, 237 
equivalence with Lamb formulation, 
239 
Hamilton’s equations, 11, 12, 25 
Hamiltonian equation of motion for 
loss-less system, 246-248 
for interaction between laser modes 
and atomic system, 247 
Hamiltonian formalism, classical, 11-22 
Hamiltonian function, 11, 14, 25 
classical, 42 
for forced harmonic oscillator, 12 
quantum mechanical, 42 
Hamiltonian operator, 26 
for an electron in an atom, 29 
Hanbury-Brown-Twiss_ effect (HBT 
effect), 85, 108, 112, 115, 120, 128, 
130, 498, 499 
explanations, 114 
not found for purely coherent field, 113 
not found with stabilized laser beam, 
85 
Hanbury-Brown-Twiss experiment, 56, 
58, 84, 85, 128, 497, 499 
results of, 84 
Hanle effect, 434, 473 
Harmonic oscillator, 32 
coordinate operator, 270 
effect of external force, 22 
energy eigenstates of, 17 
forced, 12 
forcing term in, 36 
“idler’”’, 489 
momentum operator, 270 
quantum mechanical, 15-17 
“‘signal’’, 489 
Harmonic polarization, 360 
Harmonic production, 77 
Heatbath, 491 
effect of, 313-316 
Hamiltonian of, 263 
Markoffian, 251 
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Heavy particle excitation, 441 
Heisenberg equation of motion, 178, 
184, 314 
including damping, 249-252 
for loss-less system, 246-248 
Heisenberg operator, 178 
Heisenberg picture, displacement of op- 
erators in time, 73 
Helmholtz coils, 439 
Hermite—Gaussian eigenfunctions, 334, 
337 
Hermitian conjugate, 13 
Hilbert space, operator in, 67, 509 
Hole burning into the gain curve, 230, 
232, 237 
Hollow cathode lamps, 436, 437 
Huygens principle, 79, 327, 329 
Hydrogen: 
hyperfine structure, 443 
level-crossing experiments on, 444 


I 


Ideal detector, 132, 143, 536 
based on secondary emission, 536 
Identity operator, 183 
Incoherent beam, defined, 538, 539 
Incoherent processes, 207 
Incoherent source, normalized factorial 
moments for, 151 
Incoherent system, stationary, 539 
Induced absorption, 45, 46 
rates, 213 
Induced emission, 45, 46 
rates, 213 
Induced transitions, semiclassical treat- 
ment, 29-30 
Infinite volume limit, 39 
Instrumental effects, 156 
Integrated intensity distribution, 514 
determination from photon counting 
distribution, 518-522 
using analytical generating function, 
518, 519 
by method of orthogonal polynomi- 
als, 519-522 
form of, 523-524 
general, 531 
Intensity spectrum, 197 
Intensity-correlation experiment, 2 
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Intensity fluctuations of classical light 
source, 127 
early measurements of, 128 
normalized autocorrelation function 
of, 129 
showing Bose character, 149 
Intensity fluctuation in laser, 258, 262, 
263 
Intensity fluctuation distribution, 142 
Intensity-fluctuation spectroscopy, 165- 
173 
limits of, 166 
in study of transport parameters of 
CO,, 168, 169 
in study of turbulence, 162, 169, 170, 
171 
Intensity-fluctuation spectrum, 143, 149 
of a mixed field, 161 
of single-mode gas laser, 157 
Intensity measurements, 65-78 
Interaction energy, 188, 433 
Interaction Hamiltonian, 355 
Interaction picture, 191 
reduced system density operator, 185 
Interference between light from different 
atoms, 477 
Interference experiments, 76 
Interference filters, 439 
Interferometric evidence, 2 
Inversion symmetry, 365 
Ionization cross section, 49 
Ions, paramagnetic, application of opti- 
cal methods to, 446 
Iteration procedures, 229 


J 


Josephson junction oscillators, 177 


K 


Kerr cell, 211 
Ket vectors, 133 
Kinetic energy of particle, 26 
Kleinman symmetry relation, 363, 368 
in BaTiO;3, 364 
in KDP, 464 
in quartz, 364 
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Ladder operators, 33 
Laguerre polynomials, 519, 520, 528 
Laguerre—Gaussian eigenfunctions, 334, 
337 
Lamb shift in hydrogen-like atoms, 51 
Lamb treatment, 238 
equivalence with Haken-—Sauermann 
formulation, 239 
Langevin equations, 249, 252, 253, 283 
for laser, 287 
for motion of Brownian particle, 285 
quantum mechanical, 294-297, 
313-314 
Laser, 4, 7, 53, 56, 114, 128, 130, 153, 177 
axial modes, 217-218 
emitted coherent power, 215 
frequencies, 232, 234, 371 
gas, 155, 156, 159, 240-242 
equations, for, 237-240 
intensity-fluctuation spectrum of, 
157 
giant pulse, 210-212 
homogeneously broadened, 
inhomogeneities in, 216 
intense photon beams from, 57 
linear range, 255 
as “‘local oscillator’’, 160 
losses, 205, 212 
mode pattern with Brewster angle win- 
dows, 337 
mode picture, 222 
multimode operation, 234 
net gain of stimulated process, 205 
non-adiabatic effect in, 507-512 
a non-linear device, 59, 114 
non-linear range, 257 
at high inversion, 258 
ruby, 356 
single-mode operation, 
240-242, 253-263 
solid state: 
with homogeneously broadened 
line, 232-235 
with inhomogeneously broadened 
line, 235-237 
spatial effects, 212 
steady state, 214 
‘“‘super-radiance” phenomenon in, 5, 
505 


spatial 


230, 232, 
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threshold value of pump in, 323 
two-mode operation, 231 
wave picture, 219 
Laser action: 
condition for, 206, 216 
multiple mode, 234 
single mode, 232-234 
Laser equations, 206, 207 
for frequency shift, 234, 235, 237, 241 
for gas laser, 237-240 
heuristic derivation, 219 
linewidth formula, 260 
master equation, 243 
for mode interacting with two-level 
atoms, 221 
multimode, 224, 228-229 
for photon densities, 234, 241 
rate equations, 212 
semiclassical, 253 
density matrix, 253 
fundamental drawback of, 242 
for two-level atoms, 224 
single mode, 227, 230, 254 
stationary solution, 207, 226-232 
time-dependent solutions, 209 
Laser field mode, non-Markoffian eff- 
ects, 512 
Laser light, 149, 201, 513 
coherence properties of, 5 
nature of, 4 
scattering by 
162-165 
statistical properties of, 243 


Laser-model, 219 
Laser theory, 4, 131, 201-321, 524 
expectations of, 202 
quantum, 244 
quantum-mechanical treatment, 242, 
293 
need for, 242 
nonlinear, 243 
semiclassical approach, 219-242 
structure, 201-203 
Laser transients, 159 
Laser transitions, upper, 209 
Lateral modes, separation, 325 
Lens, graphical construction for pass- 
age of beam through, 340 


Leucht-electron, 246 


random medium, 
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Level-crossing effects, 406, 407-413 
arrangement for studying, 408 
determination of lifetime, 413 
experimental results, 408 
in ground states, 433-434 
interpretation of, 410 
in mercury, 467 


Level-crossing experiments: 
compared with double-resonance ex- 
periments, 442 
on hydrogen, 444 
on manganese, 445 
with modulated light, 443 
Level shift, 433 
Light: 
circularly polarized, 363 
interaction with atoms, 395, 477 
rectification, 361, 362 
statistical properties of, 513~532 
thermal, 523 
Light absorption, 135 
Light beam: 
fluctuating, statistical properties of, 
513 
steady state, 72 


Light field: 
average amplitude, 266 
coherence properties of, 484 
description, 131-138 
detection, 138-146 
Langevin forces for, 313-316 
Light pulse propagation: 
effect of medium, 359 
through two-level system, 356-359 
Light sources for optical pumping, 
436-439 
alkali metal lamps, 437 
electrodeless lamps, 436, 437 
flow lamps, 438 
hollow cathode lamps, 436, 437 
requirements, 436 
Light waves, coupling with other excita- 
tions, 385-387 
Line broadening: 
homogeneous, 215, 216, 220 
inhomogeneous, 253, 358, 510 
**Line narrowing’, 157 
Linear operators, 311, 313 
Liouville operator, 509, 510 
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Lorentz force equation, 22 
derivation from Hamiltonian func- 
tion, 25 
Lorentz transformation, 54 
Lowering operator, 17 
Lyot filters, 440 


M 


Macroscopic variables, 312 
Magnetic field, 31 
Magnetic resonance, 448 
Magnetic resonance transition, 396, 398 
amplitude of r.f. field to induce, 399 
Magnetometry, 421 
Manley-Rowe relations, 364, 498 
Many-photon problems, 57 
Markoff approximation, 184, 187, 286, 
507 
objections to, 508 
Markoff process, 284 
Markoffian heatbaths, 251 
Markoffian property, 296, 508 
Maser, 3, 4, 6 
Master equations, 136, 146, 187, 188 
derivation of, 137 
laser, 243 
for time development, 153 
Material Hamiltonian, 355 
Matter equations: 
quantum, 202, 207, 213 
semiclassical, 222 
Maxwell’s equations, 1, 22, 24, 35, 88, 
219 
for Brillouin Stokes components, 378 
Mean occupation numbers as function 
of frequency or wave number, 122 
Mercury discharge lamp, 166 
photon coincidences, 167 
Metastability exchange, 425, 430 | 
Mode functions, 38, 59, 60, 86, 88 | 
non-monochromatic, 86 
normalization, 32 
Mode index, 59 
Mode selection theorem, 215, 216 | 
Mode volume, 134 
Modulation: 
in absorption, 459 
from metastable state, 460 | 
in fluorescent light, 457, 461 
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Molecular level-crossing spectroscopy, 
445 

Molecules, application of optical meth- 
ods to, 445 

Moment sequence, 523 

Momentum mismatch, 376 

Monitoring lamp, 430 

Monochromatic fields, 447 

semiclassical treatment, 448-453 
application to double resonance 

experiments, 452-457 

Monochromaticity, relationship with 
optical coherence, 82 

Motion, equations of, see Equations of 
Motion 

Multidetector correlation experiments, 5 

Multimode resonator, theory of, 325 

Multiple counting statistics, 143 

Multiple scattering, 441, 480 


N 


Narrabri steller interferometer, 167 
Nonlinear effects in solids, theory of, 7 
Nonlinear medium, 330 
Nonlinear optics, semiclassical treat- 
ment of, 356 
Nonlinear phenomena in condensed 
matter, 355-393 
Nonlinear susceptibility, 368, 369 
lowest-order, 359-363 
tensor, 360 
Normal modes, 180, 181 
Normal ordering operation, 516 
Normalization condition, 64 
Normalization function, 118 
structure of, 119 
Normally ordered products, 87 
Nuclear spins, measurement of, 417 
Number operator, 515 
Nutation, 429, 403, 446 
Nutational frequencies, 449, 452 
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Occupation probability, 136 
One-photon state, 40, 116 
calculation of diffraction patterns,58 
normalization condition 116 


SUBJECT INDEX 


Open resonator, 341 
with spherical mirrors, 342 
Operator, see under separate headings 
Operator equations, proofs of, 19 
Optical coherence, 81, 131, see also 
Coherence 
condition for, 81 
relationship with monochromaticity, 
81 
Optical detectors, 2 
Optical experiments: 
classical, 58 
necessity for statistical theories, 133 
Optical fields distinguished from radio- 
frequency fields, 446 
Optical-field correlations, 131 
Optical pumping experiments, 8, see 
also Pumping 
detectors, 439 
filters, 439 
lamp spectrum for, 440 
light sources, see Light sources for 
optical pumping 
magnetic fields, 438 
modulation in, 457-459 
polarizers, 440 
sample containers, 434-435 
transfer of coherence in, 461 
Optical resonators, 323-354, see also 
Resonance 
confocal, 331-334, 343 
diffraction at mirror edges, 338 
Fabry-Perot, 324-331 
high-loss, 343, 344 
low-loss, 342, 343 
non-confocal, 334-340 
possible geometries, 344 
roof-mirror, 344, 351 
with spherical mirrors: 
stable configuration, 339 
unstable configuration, 339 
stable, 343, 344 
stability diagram, 343 
unstable, 343, 344 
Optics: 
nonlinear, 59 
renaissance in, 8 
Ordered operators, 191-194 
Orientation relaxation time, 374 
Origins of quantum optics, 1 
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Orotron 345 

Orthonormality condition, 13, 27, 60 
Orthonormality relation, 32, 36 
Oscillator, quantum mechanical, 6 


P 


P-representation, 92-107, 148, 510 
integral of P over complex plane, 92 
P not a probability density, 93 
P a quasi-probability density, 94, 96 
question of construction, 101 

Packet amplitude, 124 
squared, as function of frequency or 

wave number, 122 
Pair creation and annihilation, 51 
Parametric amplification in a nonlinear 
dielectric, 501 

Parametric amplifier, 6, 75, 95, 489 
optical, 7, 177 
theory, 364 

Parametric approximation, 496 

Parametric conversion, high efficiency, 

491 

Parametric emission: 
spontaneous, 493, 495 
stimulated, 493 

Parametric oscillation, 489 
degenerate, 489 
noise, 489, 490 
quantum statistics of, 489-500 
statistical properties of, 499 

Parametric oscillator, 489 
generation of signal and idler modes 

in, 490 
scheme of, 490 

Parametric processes, phase-dependent, 7 

Particle beam: 
fluctuations, 535-544 
statistical properties, 535 

Pauli exclusion principle, 542 

Periodic boundary conditions, 38, 59 

Periodic perturbation, response of mat- 

erial to, 360 

Periodic systems, 341-345 

Perturbations, 467-477 
time-dependent, 455 

Perturbation theory, 43, 136 
n-th order, 70 
time-dependent form, 26 

Phase diffusion, 257 
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Phase noise, 260 
Phase shift, 328 
Phenomenological damping term, 355 
Phonon-maser, 167 
Phonon, acoustical, 377 
Phonon wave, optical, 372 
Photoabsorption, rate of, 69 
Photo-cells, 438 
Photodetection equation, 515-517, 526 
generalized, 525 
Photodetections, bunching of, 130 
Photodetector, 65 
Photoelectric effect, 1 
Photoelectrons, distribution, 513, 514, 
517 
Photomultiplier tube, 144, 438 
characteristics of, 144 
Photon absorption: 
by photodetector, 514 
probability, 46 
stimulated, 205 
Photon coincidence experiment, 110 
Photon counters, 84 
Photon counting distribution, 140, 141, 
149, 153, 513, 514, 518, 528 
effects of finite sampling time on, 152 
factorial moments of, 140, 142, 155, 
161, 513, 514, 518, 528 
of gas laser in transient operating 
conditions, 159 
Photon counting experiments, 115 
Photon density: 
equation for, 234, 236, 241 
as function of pumping, 233, 236 
Photon emission, 204 
stimulated, see Stimulated emission 
total rate of spontaneous, 204 
Photon flux density, 364 
Photon numbers, 231, 232, 266 
in a chaotic field, 120 
determination of, 215, 216 
distribution, 288, 517 
finding distribution, 123 
as function of pumping, 214 
in laser modes, 232 
mean, 503, 504 
mean, in steady state, 199 
Poisson distribution for, 86, 153, 503 
rate equation for, 226 
spectrum, 197 
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Photon packets, 116, 117, 118 

weight function, 120 

weighting to favour overlap, 123 
Photon states, transition rate into, 484 


Photon statistics, 127-176 
of laser fields, 152-159 
of light field, 5 
of mixed fields, 159-162 
of scattered laser light, 162-173 
of thermal fields, 146-152 
Photons, 1, 3, 115, 183 
distribution of absorbed, 513 
generation and annihilation rates, 203 
lifetimes, 207 
location to maximize entropy, 120 
multiple detection of, 143 
optical, 144 
probability distribution of emitted, 
503, 504, 505 
tendency to clamp in beam, 112 
Piezoelectric crystals, ionic, 382 
Planck density operator, 107 
Plane waves, 110 
development of pulse, 390 
modes, 48, 117 
Plasma waves, Hamiltonian density for, 
. 385 
Polariton wave, 382 
Polarizability tensor, 478 
bulk, 478 
Polarization, 219, 375 
atomic, 223 
change, resulting from radio-frequen- 
cy transition, 397, 398 
transfer by collision, 464 
Polarization index, 38 
Polarization cubic in the electric field 
amplitude, 365-369 
Polarizers, 440 
Population inversion, 4 
Positive frequency field, 81 
Positive frequency terms, 60 
Potentials and gauges, 22-25 
Poynting vector, 24 
Pressure broadening, 441-442 
Principle of detailed balance, 28 
Probability density, Fourier transform 
of, 97 
Probability distribution of complex 
numbers, 60 
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normalization, 61 
Projection operators, 293, 298 
applied to subsystems, 311 
rules for shifting, 304-307 
Pseudo-mode, 324 
Pulse approximation, 400 
Pulse excitation, 410 
Pulsed laser beams, 355 
Pump mechanism, 358 
time constants for, 428 
Pump rates, 208 
Pumping, 207, 210, 213, 227, 414-427 
apparatus for studying, 415 
with buffer gases, 423 
circulation of coherence in, 462 
description in terms of irreducible 
spherical tensor operators, 434 
detection by radioactive emission, 444 
exponential growth of signal, 428 
hyperfine, 424 
light sources for, see Light sources for 
optical pumping 
to measure fundamental physical 
quantities, 443 
from metastable state, 425 
to produce polarized system, 433 
refractive index as monitor of, 477-— 
480 
relaxation studies, 444 
signal in rubidium, 419 
via auxiliary levels, 225 
with white light, 436 
Pumping parameter, 197 


Q 


q-numbers, 62 
Q-spoiling, 211 
Quanta in a cubic wavelength, 54 
Quantization, I 
of electromagnetic field, 30-42 
Quantum-classical correspondence, 
191-197 
Quantum counter, 517 
Quantum detection, first development 
of, 56 
Quantum electrodynamics, 11-52 
field equations, 202, 212, 213 
matter equations, 202, 207-208, 213 
wave-photon dualism, 203 
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Quantum mechanics: 
equations of motion, 177-180, 246- 
248 


expectation values, calculating by 
c-number procedures, 299 
field equations, 246, 252 
mathematical difference from classi- 
cal mechanics, 191 
matter equations, 246, 253 
operators of, 13, 61 
reviewed, 13-15, 177-184 
Quantum noise, 4 
Quantum states, 63 
Quantum systems far from thermal equi- 
librium, 293-313 
Quantum theory, 62-65 
Quasi-linearization, quantum mechani- 
cal, 259 
Quasi-probability density, 94, 96, 97, 
101, 103 
normalization condition for, 102 
Quasi-probability distribution function, 
494 
Quasi-Stark effect, 211 
Quasi-thermal field, 150 
Quasi-thermal light, 149 
Quasi-thermal sources, 148 


R 


Radar meteorology, 128 
Radiation: 
degree of control of, 55 
from systems of N two-level atoms, 
501-506 
formal analogy with parametric 
amplifier, 501 
Radiation field, 180-184 
driven damped mode in cavity, 188 
eigenvalues and eigenvectors, 181-183 
energy in cavity, 181 
ground state, 63 
interaction with single bound elect- 
ron, 133 
quantization, 180 
quantum states of, 63 
statistical properties of optical, 127, 
177 
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Radiation gauge, 29 
condition, 23, 24 
potentials, 23 
vector potential, 31, 36 
Radio-frequency fields distinguished 
from optical fields, 446-447 


Radio-frequency interactions, 446-482 
Radio-frequency power broadening, 400 
Radio-frequency resonance, 417, 426 
effect of increasing r.f. power, 422 
in ground state of rubidium, 418 
at 3 kHz, 419 
effect of increasing power, 420 
monitoring, 417 
produced at 5 kHz by rotating field, 
422 
Raman effect, 6 
coherent, 75 
stimulated, 7, 366, 370-374, 382, 387 
Raman gain, 373 
coefficient, 370 
Raman scattering, 385 
ring pattern in induced, 7 
stimulated, 371, 490 
from polaritons, 382-384 
Raman susceptibility, 373 
Rate equations, 203-219 
derivation, 225, 226 
for photon numbers, 226 
for three-level system, 208~209 


Rate processes, 446 


Rayleigh scattering: 
inelastic, from anisotropic molecules, 
374-377 
stimulated, 367 
inelastic, 375, 376 
thermal, 379-382, 392 
antistokes shift, 381 
gain constant, 381 
transient effects in, 391 


Rayleigh spectrum of benzene, 166 
Recombination rates, 209 


Refractive index: 
intensity dependent, 366 
modulation of, 367 
as monitor of optical pumping, 477- 
480 


Relaxation oscillations, 210 
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Relaxation processes, 227, 417, 427-430 
in the dark, 428 
methods of study, 427 
semiclassical interpretation, 432-433 
transverse, 430 
with pumping light, 428 
Relaxation times, 423 
of *He atoms in ground states, 427 
Renewal process, 541 
Resonance: 
in excited states, 479 
multiple quantum, 421, 470, 471 
in metastable state, 426 
nuclear, 426 
radio-frequency, see Radio-frequency 
resonance 
Resonance curves, 406, 458 
demonstrating light shift, 431 
width of, 423 
Resonance displacement, 468, 469 
Resonance excitation, 430 
Resonance fluorescence, 396, 406 
in mercury, 397 
in potassium, 403 
in rubidium, 416 
in thallium, 408 
by two paths, 411 
Resonance functions, 458 
Resonant cavities, 423 
lossy, 323 
Resonant frequency, 370 
Resonator, quality factor of, 210 
Roof-mirror resonators, 345-351 
flat-roof, see Flat-roof resonator 
90° roof, 351 
power loss in, 351 
Rotating wave approximation, 222, 223 
Running waves, 360 


S 


s-ordered generating function, 531 
s-ordered products of operators, 98 
Statistical average of, 99 
s-ordering of field operators, 524-526, 
530 
Saturable absorption, 366 
Saturated inversion, 208 


Scalar field, spatial and temporal correl- 
ations of, 162 
Scalar potential, 22 
Scanning electrical filter, 144 
Scattered field, 163 
Scattering: 
as basis of detector, 72 
Compton, 49 
elastic, 48, 49 
of laser light by random medium, 162 
of radiation by electron in atom, 47 
differential cross-section for, 48 
Scattering vector, 163 
Schrédinger’s equation, 14, 20, 26, 27, 
177, 182, 502 
Schrédinger picture, 184, 485 
density operator in, 179, 188 
non-commuting operators in, 191 
transformation to interaction picture, 
185 
Schwarz inequality, 81 
Sealed-off sample cells, 435 
vapour pressure control in, 434 
wall coatings, 423, 425 
Second harmonic generation, 369 
in isotropic fluid, 368 
Self-focusing of light, 367, 377 
Self-induced transparency, 221, 356 
Semiclassical equations, 203, 224 | 
Semiclassical theory, 11, 51, 355 
of laser, 219-242 
dimensionless amplitudes, 223 | 
Sensitivity factor, 69 
Single-clipped autocorrelation of photon 
counting fluctuations: 
apparatus for measuring, 171 
determining translational diffusion 
coefficients of protein molecules 
by, 170, 172, 173 
Single counting statistics, 140-143 
Spatial correlations, 150 
Spectral data of field, 108, 110 
Spectral lines, self reversed, 436, 437 
Spectrum, see Electromagnetic spectrum 
Spheroidal coordinates, 335 
Spin exchange, 424 
Spin-flipping, 446 
Spin-orbit interaction, 386, 387 
Spin-wave Raman scattering, 386-387 
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Spontaneous emission, 44, 45, 503 
coherence in, 483-487 
a quantum mechanical process, 242 
treatment of, 447 
Wigner—Weisskopf theory of, 483, 484 
Spontaneous emission noise, 250, 260 
Spot size, 334, 336, 348 
State-vector, 15, 17, 178 
dyadic product of, 67 
Stimulated emission, 201, 204 
as basis for detector, 71 
mean photon number in, 504 
Stochastic processes, 61 
condition for stationarity, 73 
quantum-classical correspondence 
for, 177-200 
stationary, 72 
Stokes-antistokes coupling, 374 
Stokes frequency, 367, 370, 372 
Stokes light, spatial growth of, 388 
Subharmonic generation, 489 
Sum frequency generation in isotropic 
fluid, 362 
Superposition of coherent and thermal 
fields: 
diagonal representation function for, 
527 
multi-mode, 527-531 
normal generating function for, 527 
one-mode, 526, 527 
Susceptibility of medium, 490 
System operator, 178 
function, in Schrédinger picture, 184 


T 
Thermal fields: 
detection using heterodyne technique, 
$28 


one-mode, 526 

photon statistics of, 146-152 
Thermal-light generation, 146 
Thermal noise, 250 
Third harmonic generation, 366 

in isotropic fluid, 367 
Three-boson process, 7 


Threshold inversion, 281 
Time-dependence removal operator, 
449, 468 


567 


Time-displacement operator, 449, 450 

Total inversion, 205 

Total momentum operator, 41 

Total occupation difference, 255 

Total photon number operator, 34, 41 

Total spin operators, 255 

Transient response, 387-392 

Transients, 426 

Transition amplitude, 65 

Transition probabilities, 28, 45, 46, 48, 
66, 68, 143 

for induced absorption process, 29, 

30, 46 

Transition rates, 294 

Transmission parameter, 79 

Transversality condition, 59 

Two-photon absorption, 366 

Two-photon states, basic, 41 

Two-time average, 197 


U 


Ultra-distribution, 101 
Undamped oscillation, 262 
Unit operator, 89 
Unit polarization vector, 515 
Unitary displacement operator, 97 
Unloaded interferometer: 

frequency in, 223 

normalized eigenfunctions of, 222 
Unperturbed system, time dependence 

of, 27 

Unsaturated inversion, 207 


v 


Vacuum field, 355 

Vacuum fluctuations, 489 

Vacuum polarization, 51 

Vacuum state, 68 

application of creation and annihil- 

ation operators to, 63, 117 

Van der Pol equation, 507 

Van der Pol oscillator, quantum mech- 
anical, 259, 284 
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Van der Pol oscillator, rotating wave, 
197, 200 
fluctuation spectrum: 
above threshold, 200 
below threshold, 198 
Fokker—Planck equation for, 197, 287 
above threshold, 198-200 
below threshold, 198 
Variational techniques, 331 
Vector mode functions, 59, 62 
Vector potential, 22, 31, 36, 180 
von Neumann equation of motion of 
density matrix, 136 
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Wave analyser, 144 

Wave packet formalism, 535 

Wave packets, normalized state for, 119, 
see also Photon packets 

Wave-picture, 219 

Wave vector mismatch, 365 
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Wave vector relationship in stimulated 
Raman scattering of stokes and anti- 
stokes radiation, 371 
Wave vector representation, 37, 38 
Waves treated as scalar quantities, 327 
Weisskopf—Wigner theory, 5, 483, 484 
Wigner functions, 93, 94, 96, 270, 271 
establishing connection between quan- 
tum and classical variables, 269 
Fourier transforms of, 272 
properties of, 103 


¥ 


Young’s experiment, 79 


Z 


Zeeman resonance shift, 430 

Zero-point energy, 17, 19, 33 

Zero point vibrations, 64 

Zwanzig’s projector formalism, 508, 509 
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